CoSTseq (Co-transcriptional structure tracking)

Data processing pipeline and analysis code for handling CoSTseq and DMS-MaPseq sequencing data sets.

Installing the CoSTseq package

CoSTseq uses Snakemake and a custom conda environment. In addition, the following software packages need to be installed and accessible for Snakemake: fastp, STAR, UMICollapse, RNAstructure, samtools. Set the path to each software package in the config/snake_CoST.yaml file.

# clone repository
git clone https://github.com/NeugebauerLab/CoSTseq.git
cd CoSTseq

# create conda environment
conda env create --name CoST --file=CoST_env.yml # create new environment from template
conda activate CoST # activate

# install python package
pip install -e .

Running the data processing pipeline

After completing the configuration process, the pipeline can be executed using the following command:

conda activate CoST
snakemake -c 16 -d smk_rundir/test --configfile config/snake_CoST.yaml --resources mem_mb=32000 --rerun-incomplete --use-conda

Re-generating analyses and figures from "Rapid folding of nascent RNA regulates eukaryotic RNA biogenesis"

To re-generate the analyses, first run the analysis pipeline on deposited raw data, or download processed files from GEO accession number GSE254264. Save the processed data files in the data directory. Analysis code for each individual figure is available in jupyter notebooks in the notebooks directory and can be executed directly.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
config		config
data		data
notebooks		notebooks
scripts		scripts
src		src
CoST_env.yml		CoST_env.yml
LICENSE		LICENSE
README.md		README.md
Snakefile		Snakefile
cmap.txt		cmap.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CoSTseq (Co-transcriptional structure tracking)

Installing the CoSTseq package

Running the data processing pipeline

Re-generating analyses and figures from "Rapid folding of nascent RNA regulates eukaryotic RNA biogenesis"

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CoSTseq (Co-transcriptional structure tracking)

Installing the CoSTseq package

Running the data processing pipeline

Re-generating analyses and figures from "Rapid folding of nascent RNA regulates eukaryotic RNA biogenesis"

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages