-
Notifications
You must be signed in to change notification settings - Fork 4
Pipeline philosophy
Single-cell data analysis cannot be done through a one-shot pipeline. Indeed, the parameters used in the pipeline strongly depend on the quality of the data and the biological variability (due to the type of tissue studied). So, you have to estimate them based on the previous steps. For example, for the filter thresholds of poor quality droplets or empty droplets, the quality of the droplets in the sample should be studied.
Thus, there are many interruptions in the pipeline that must be performed, materialized by steps:
- Alignment_countTable_GE,
- Droplets_QC_GE,
- Filtering_GE,
- Norm_DimRed_Eval_GE,
- Clust_Markers_Annot_GE,
- Cerebro,
- Alignment_countTable_ADT,
- Adding_ADT,
- Alignment_annotations_TCR_BCR,
- Adding_TCR,
- Adding_BCR.
This step system also makes it easy to rerun specific steps to test multiple parameters and compare results, without having to rerun the entire pipeline.
Also, it allows to choose the type of analysis to be carried out according to the type of data (gene expression, cell surface proteins, immune repertoire profiling, etc.), and thus to be able to carry out multiomics analysis, all combined in a single pipeline of analysis.
To help distinguish between different types of data easier, a tag is added as a suffix to the name of the samples:
- "_GE": for Gene Expressions
- "_ADT": for Antibody-Derived Tags (Cell Surface Proteins corresponding to the CITE-seq part)
- "_TCR": for T Cell Receptor expression
- "_BCR": for B Cell Receptor expression
If the tag is already at the end of your sample, it will not be added.
Resources of the Theory of single cell RNA-seq
v1.3
Pipeline details
Configuration
-
Parameter file
- Steps
- Alignment_countTable_GE
- Droplets_QC_GE
- Filtering_GE
- Norm_DimRed_Eval_GE
- Clust_Markers_Annot_GE
- Cerebro
- Alignment_countTable_ADT
- Adding_ADT
- Alignment_annotations_TCR_BCR
- Adding_TCR
- Adding_BCR
- Int_Norm_DimRed_Eval_GE
- Int_Clust_Markers_Annot_GE
- Int_Adding_ADT
- Int_Adding_TCR
- Int_Adding_BCR
- Grp_Norm_DimRed_Eval_GE
- Grp_Clust_Markers_Annot_GE
- Grp_Adding_ADT
- Grp_Adding_TCR
- Grp_Adding_BCR
- Additional files
Results help
- Arborescence of all results
-
Observations and weird results
- Not a threshold by emptyDrops
- Large and small cells into the same sample
- emptyDrops does't work well
- More than 15% mitochondrial RNA while I filtered them out at 15%
- Impact of empty droplets on umap
- Choose the right number of dimensions
- Be careful with the colors, they are sometimes misleading
- Impact of bias correction on umap
Complete Examples of school cases
Individual analysis :
1 sample (scRNA-seq + ADT + TCR + BCR)
Grouped/Integrated analysis :
2 samples (scRNA-seq + ADT + TCR + BCR)