Use new sequence type in alignment. by tmooney · Pull Request #741 · genome/analysis-workflows

tmooney · 2019-07-23T20:32:36Z

This is #740 plus an implementation of that concept in the pipelines that include the standard alignment subworkflows. I had to make an adapter subworkflow because Cromwell was not staging the file inputs when they were pulled as arguments directly leading to errors about files not being found.

I've tested this on the somatic_exome.yaml example but not run any full-scale pipelines yet.

apaul7

should sequence_align_and_tag_adapter.cwl be under definitions/subworkflows/ and not definitions/tools/?

tmooney · 2019-08-01T19:23:27Z

My snarky answer would be "it shouldn't exist at all!", but I moved it under subworkflows 😄.

apaul7

+1

jasonwalker80

In general, +1.

One comment, the new type is sequence_data and it has two items, sequence and readgroup. In the workflows, the inputs are always named sequence even though they are sequence_data inputs. One suggestion for consideration is to use tumor_sequence_data instead of tumor_sequence to avoid the confusion between the sequence_data input vs. the actual sequence property, ie. BAM or FASTQ, and the readgroup property.

NOTE: I realize my differentiation between input vs. property may not be correct. I'm happy to discuss in slack in more detail if this is confusing.

tmooney added 5 commits July 18, 2019 12:50

New type to represent sequence data.

0ab1c2a

New tool to align sequence data regardless of filetype.

69ecfe2

Subworkflow to perform alignment on "sequence_data".

eece0dc

Use new sequence type for standard DNA alignment process.

6259aa5

Use an adapter to circumvent issues in staging files.

b2b638d

tmooney force-pushed the dna_pipelines_use_sequence_type branch from 2914f78 to b2b638d Compare July 23, 2019 21:26

apaul7 reviewed Jul 23, 2019

View reviewed changes

Move adapter under subworkflows.

398f7da

apaul7 approved these changes Aug 1, 2019

View reviewed changes

jasonwalker80 approved these changes Aug 7, 2019

View reviewed changes

tmooney merged commit 60edaf6 into genome:master Aug 13, 2019

tmooney deleted the dna_pipelines_use_sequence_type branch August 13, 2019 18:02

chrisamiller mentioned this pull request Sep 5, 2019

Cram support #609

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use new sequence type in alignment.#741

Use new sequence type in alignment.#741
tmooney merged 6 commits intogenome:masterfrom
tmooney:dna_pipelines_use_sequence_type

tmooney commented Jul 23, 2019

Uh oh!

apaul7 left a comment

Uh oh!

tmooney commented Aug 1, 2019

Uh oh!

apaul7 left a comment

Uh oh!

jasonwalker80 left a comment •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

tmooney commented Jul 23, 2019

Uh oh!

apaul7 left a comment

Choose a reason for hiding this comment

Uh oh!

tmooney commented Aug 1, 2019

Uh oh!

apaul7 left a comment

Choose a reason for hiding this comment

Uh oh!

jasonwalker80 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

jasonwalker80 left a comment •

edited

Loading