Context
Snakemake has removed the --stats option in v8, so I'm looking into the --report option for long term workflow stats.
The Snakemake report must be generated after the workflow has finished. I thought this would be as simple as attaching/downloading an old AWS Batch job then running nextstrain build . --report.
When I did this for ncov-ingest, I saw a bunch of warnings along the lines of:
Missing metadata for file data/gisaid/metadata.tsv. Maybe metadata was deleted or it was created using an older version of Snakemake. This is a non critical warning.
I then realized we are explicitly excluding Snakemake state in the downloads from AWS Batch:
|
# We don't want the remote Snakemake state to interfere locally… |
|
".snakemake/", |
|
|
|
# Ignore Python bytecode |
|
"*.pyc", |
|
"__pycache__/", |
|
]) |
|
|
|
included = path_matcher([ |
|
# But we do want the Snakemake logs to come over. |
|
".snakemake/log/", |
|
]) |
Possible solutions
- Include
.snakemake/metadata in the downloads from AWS Batch so that users can generate the Snakemake report locally.
- Automatically generate the Snakemake report within the AWS Batch job so that users can download the rendered report
[2] definitely seems like the nicer option and maybe should be applied across all runtimes for nextstrain build?
Context
Snakemake has removed the
--statsoption in v8, so I'm looking into the--reportoption for long term workflow stats.The Snakemake report must be generated after the workflow has finished. I thought this would be as simple as attaching/downloading an old AWS Batch job then running
nextstrain build . --report.When I did this for ncov-ingest, I saw a bunch of warnings along the lines of:
I then realized we are explicitly excluding Snakemake state in the downloads from AWS Batch:
cli/nextstrain/cli/runner/aws_batch/s3.py
Lines 113 to 124 in 8ed779c
Possible solutions
.snakemake/metadatain the downloads from AWS Batch so that users can generate the Snakemake report locally.[2] definitely seems like the nicer option and maybe should be applied across all runtimes for
nextstrain build?