Single-cell RNA seq analysis of Multiple Sclerosis CSF & blood

Preamble

This directory contains scripts used to analyse scRNAseq data from the Cambridge and TUM cohorts.
Single cell sequencing data were generated using 10X 5' and VDJ single cell sequencing technology from CSF and PBMC samples in a cohort of people with Multiple Sclerosis and other neurological disease controls.
All analyses were run on the Cambridge Slurm HPC.
Unless specified, all scripts were run in R/4.0.3 or R/4.1.0.
You can access the paper in Cell Reports Medicine here.
Raw data (Seurat objects containing 5' gene expression and VDJ receptor sequencing data for B and T cells) can be downloaded via Zenodo here.
Code contributors: Ben Jacobs and Christiane Gasperi

Code

Deconvolution and basic qc

sbatch gex_deconvolution_qc_step1.sh

This script runs Rscript gex_deconvolution_qc.R which performs the following QC steps on each batch of GEX data

Filtering by RNA count & MT%
Ambient RNA correction with SoupX
Doublet identification
Per-batch normalisation with SCTransform

Integration

sbatch integration_icelake.sh

This script runs Rscript integration.R which integrates the post-qc datasets using Harmony.

UMAP

sbatch umap_icelake.sh

This script runs Rscript umap.R which performs UMAP and Louvain clustering across a range of parameters.

Cluster biomarkers

sbatch cluster_biomarkers_icelake.sh

This script runs Rscript find_cluster_biomarkers_and_update_pheno.R which does the following

Cleans phenotypes
Makes some plots exploring clustering
Compares annotations of cell types across different methods (SingleR, Azimuth, Celltypist)
Calculates cluster-specific biomarkers

CellTypist annotation

sbatch celltypist.sh

Which splits clusters with Rscript celltypist_prep.R and then runs celltypist in each cluster.

Update cluster IDs

sbatch update_clusters.sh

Which runs Rscript update_cluster_labels.R to update cluster IDs

DE & DA

To run DE and DA using the broad clusters:

sbatch de_icelake.sh

Runs DE tests with Rscript de_da_tests_phenotypes.R
Summary plots then made with Rscript de_summary_plots.R

GSEA

Then to run GSEA on the broad clusters with those DE results:

sbatch gsea.sh

Pathway analysis

sbatch pathway_analysis.sh

Cell-cell communication

sbatch ccc_per_sample.sh

Which runs LIANA on a per-sample basis
Rscript Rscript ccc_overall.R then combines and explores these results

Prepare for Immcantation/Dandelion

These scripts prepare the VDJ data for QC with dandelion

Rscript dandelion_preparation.R TCR
Rscript dandelion_preparation.R BCR
Rscript dandelion_preparation2.R TCR
Rscript dandelion_preparation2.R BCR
Rscript make_dandelion_metafile.R

And then to run dandelion pre-processing:

sbatch dandelion_bcr.sh
sbatch dandelion_tcr.sh

Filter GEX based on VDJ QC

These scripts then filter the GEX data based on the QC'd VDJ data:

sbatch dandelion_filtering_tcr.sh
sbatch dandelion_filtering.sh

Analyse VDJ data

Rscript bcr_analysis.R
Rscript tcr_analysis.R

eQTL analysis

./eQTL_analysis/MasterScript.sh contains the eQTL pipeline and refers to scripts in ./eQTL_analysis/.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Single-cell RNA seq analysis of Multiple Sclerosis CSF & blood

Preamble

Code

Deconvolution and basic qc

Integration

UMAP

Cluster biomarkers

CellTypist annotation

Update cluster IDs

DE & DA

GSEA

Pathway analysis

Cell-cell communication

Prepare for Immcantation/Dandelion

Filter GEX based on VDJ QC

Analyse VDJ data

eQTL analysis

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
eQTL_analysis		eQTL_analysis
README.md		README.md
bcr_analysis.R		bcr_analysis.R
bcr_rep_analysis.R		bcr_rep_analysis.R
ccc_per_sample.R		ccc_per_sample.R
ccc_per_sample.sh		ccc_per_sample.sh
celltypist.sh		celltypist.sh
celltypist_prep.R		celltypist_prep.R
cluster_biomarkers_icelake.sh		cluster_biomarkers_icelake.sh
dandelion_bcr.sh		dandelion_bcr.sh
dandelion_bcr_filter_to_donor.R		dandelion_bcr_filter_to_donor.R
dandelion_filtering.sh		dandelion_filtering.sh
dandelion_filtering_tcr.sh		dandelion_filtering_tcr.sh
dandelion_preparation.R		dandelion_preparation.R
dandelion_preparation2.R		dandelion_preparation2.R
dandelion_tcr.sh		dandelion_tcr.sh
de_da_tests_phenotypes.R		de_da_tests_phenotypes.R
de_icelake.sh		de_icelake.sh
de_summary_plots.R		de_summary_plots.R
find_cluster_biomarkers_and_update_pheno.R		find_cluster_biomarkers_and_update_pheno.R
gex_deconvolution_step1.R		gex_deconvolution_step1.R
gex_deconvolution_step1.sh		gex_deconvolution_step1.sh
gsea.R		gsea.R
gsea.sh		gsea.sh
integration.R		integration.R
integration_icelake.sh		integration_icelake.sh
make_dandelion_metafile.R		make_dandelion_metafile.R
pathway_analysis.R		pathway_analysis.R
pathway_analysis.sh		pathway_analysis.sh
process_data_for_sharing.R		process_data_for_sharing.R
process_data_for_sharing.sh		process_data_for_sharing.sh
tcr_analysis.R		tcr_analysis.R
tcr_rep_analysis.R		tcr_rep_analysis.R
umap.R		umap.R
umap_icelake.sh		umap_icelake.sh
update_cluster_labels.R		update_cluster_labels.R
update_clusters.sh		update_clusters.sh

Folders and files

Latest commit

History

Repository files navigation

Single-cell RNA seq analysis of Multiple Sclerosis CSF & blood

Preamble

Code

Deconvolution and basic qc

Integration

UMAP

Cluster biomarkers

CellTypist annotation

Update cluster IDs

DE & DA

GSEA

Pathway analysis

Cell-cell communication

Prepare for Immcantation/Dandelion

Filter GEX based on VDJ QC

Analyse VDJ data

eQTL analysis

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages