GitHub - chhetribsurya/Encode_wgbs_pipe: WGBS DNA methylation pipeline for ENCODE consortium samples

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
job_wait_scripts		job_wait_scripts
summary_report_ofall_LIBS		summary_report_ofall_LIBS
Readme.txt		Readme.txt
bismark_pipeline_main.sh		bismark_pipeline_main.sh
bismark_pipeline_main_oldcluster.sh		bismark_pipeline_main_oldcluster.sh
call_fastq_split.sh		call_fastq_split.sh
call_insert_size_plots_using_samtools.sh		call_insert_size_plots_using_samtools.sh
call_mergeUnsorted_dedup_files_for_methExtraction.sh		call_mergeUnsorted_dedup_files_for_methExtraction.sh
call_sort_for_coverage.sh		call_sort_for_coverage.sh
call_trim_galore_bismark_alignment.sh		call_trim_galore_bismark_alignment.sh
check_successful_completion.sh		check_successful_completion.sh
copy_wgbs_file_from_flowlane.sh		copy_wgbs_file_from_flowlane.sh
fastq_split.sh		fastq_split.sh
final_samtools_insertsize.R		final_samtools_insertsize.R
insert_size_plots_using_samtools.sh		insert_size_plots_using_samtools.sh
mergeUnsorted_dedup_files_for_methExtraction.sh		mergeUnsorted_dedup_files_for_methExtraction.sh
oneliner_summarise_python_bismark_qc_analysis.sh		oneliner_summarise_python_bismark_qc_analysis.sh
python_bismark_qc_analysis.py		python_bismark_qc_analysis.py
sort_for_coverage.sh		sort_for_coverage.sh
trim_galore_bismark_alignment.sh		trim_galore_bismark_alignment.sh

Repository files navigation

## WGBS Pipeline

[Description]
Basically, this is a bismark_pipeline for paired-end fastqs. The idea would be to split the fastq files in to smaller ones (around 18 million reads) and parallelize the job for QC using trim galore, and then simultaneously align in parallel with the bowtie2 instances on different nodes, which would  provide us the bam alignment quicker. Eventually, we could merge those unsorted bam files back to remove the PCR duplicates, and further run on methylation_extractor or coverage metrics on those merged de-duplicated bams.

[Usage:]
./bismark_pipeline_main.sh 

[List of variables]:
Following variables must be changed in 'bismark_pipeline_main.sh'

OUTPUT_LOC, 
INPUT_LOC, 
LIB_LIST, 
GENOME_PATH, 
BISMARK_PATH, 
SAMTOOLS_PATH, 
TRIMGALORE_PATH, 
BOWTIE_PATH, 
CORE_NUM