Miscellaneous bioinformatics

Navigating common challenges in microbial ecology.

./Cutadapt

How to trim primer sequences from 16S rRNA gene reads generated by Illumina. We currently have code for V4 and V4-V5 16S rRNA gene regions.

./DADA2

1. filterAndTrim_bigData.R. At the filter and trim step, process groups of samples one at a time instead of all samples simultaneously. Saves time and computer power and crashes and headaches.

catFASTQ.sh

Concatenate FASTQ files with identical names. Its original purpose was to combine files from two sequencing runs (on full and nano Illumina flow cells) on the same samples.

Demultiplex_fastq.sh

Trim primers and sort reads according to their unique barcodes. Mothur has this ability, however it also merges paired-end reads in the process. This script's original purpose was to sort reads from mutiple isolate 16S rRNA genes, sequenced simultaneously, based on unique oligos on the 5' ends of primers. This will be updated so it will take a list of file names as input.

downloadMultipleSRA_series.sh

Download multiple files from NCBI Sequence Read Archive. Use when you're interested in runs that are named as a series of numbers, which is typical for BioProjects (e.g., runs in project PRJNA597057 range from SRR10755563 to SRR10755886).

downloadMultipleSRA_text.sh

Download multiple files from NCBI Sequence Read Archive. Use when you're interested in runs that are not named in a series. Create a text file called "runs.txt" at the end of the name. For example...

lou$ head runs.txt
ERR2129782
ERR2129783
ERR2129800
ERR2129801
ERR2129803
ERR2129872
ERR2129873
ERR2129875
ERR2129891
ERR2129909

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
Cutadapt		Cutadapt
DADA2		DADA2
README.md		README.md
catFASTQ.sh		catFASTQ.sh
demultiplexFASTQ.sh		demultiplexFASTQ.sh
downloadMultipleSRA_series.sh		downloadMultipleSRA_series.sh
downloadMultipleSRA_text.sh		downloadMultipleSRA_text.sh
merge_ASV_tables.R		merge_ASV_tables.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Miscellaneous bioinformatics

./Cutadapt

./DADA2

catFASTQ.sh

Demultiplex_fastq.sh

downloadMultipleSRA_series.sh

downloadMultipleSRA_text.sh

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Miscellaneous bioinformatics

./Cutadapt

./DADA2

catFASTQ.sh

Demultiplex_fastq.sh

downloadMultipleSRA_series.sh

downloadMultipleSRA_text.sh

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages