Template of pipeline using cgatcore.
- Clone the new repository to the computer where you wish to run the pipeline.
- Create a Conda environment named
pipeline-envusing the fileenvironment.yml. - Run the pipeline!
- On a High-Performance Computing (HPC) cluster,
python pipeline.py make full -v 5, to use the Distributed Resource Management Application API (DRMAA). - On a local machine
python pipeline.py make full -v 5 --local.
- On a High-Performance Computing (HPC) cluster,
- Create a new repository from this one, using the
Use as templatebutton on GitHub.- That way, your new repository starts its own commit history, where you can record your own changes!
- Only fork this repository if you wish to contribute updates to the template pipeline itself.
- Clone the new repository to the computer where the pipeline will be run.
- Edit the file
pipeline.pyto define the pipeline workflow.- Add import statements to the
Importssection. - Edit the section
Configurationas needed, if needed. - Replace the contents of the section
Workflowwith your own pipeline steps. - The
Main executionsection should be left as-is
- Add import statements to the
- Edit the file
environment.ymlto define a Conda environment for running this pipeline. - Edit the configuration of the pipeline as needed, in the file
config.yml. - Edit the configuration of the pipeline as needed, in the file
config.yml. - Run the pipeline!
- On a High-Performance Computing (HPC) cluster,
python pipeline.py make full -v 5, to use the Distributed Resource Management Application API (DRMAA). - On a local machine
python pipeline.py make full -v 5 --local.
- On a High-Performance Computing (HPC) cluster,