This goal of this pipeline is to facilitate reproducible processing of 10X Genomics data through a standardized workflow.
- Download raw sequencing data, check MD5sums, untar, run bcl2fastq
- SLURM array for GEX alignment
- SLURM array for CITEseqCount (dependent on completion of stage 2)
- SLURM array for TCR/BCR alignment (independent of stages 2 and 3)
- Update read structure in bcl2fastq for dual indices
- Specify sample_sheet_creator_dual_index.R for dual index sample sheet creation
- Build capacity for multiome (i.e. paired scRNAseq and paired scATACseq) analysis - cellranger ARC required
- Automate backup to Pitt Box and/or AWS
- Aggregate web summaries from GEX and TCR/BCR alignment and run_info from CITEseqCount