Skip to content
This repository was archived by the owner on Jul 28, 2025. It is now read-only.
This repository was archived by the owner on Jul 28, 2025. It is now read-only.

Large Bam files fail to copy into the output directory #733

@simalicrum

Description

@simalicrum

Describe the bug
Running the workflow here on a >100GB dataset: https://mondrian-scwgs.github.io/mondrian/#/

Workflow steps appear to complete successfully until the end of the last workflow task where a 77GB output Bam should be copied into the output directory to complete the run. All other output files are copied into the output directory.

The successfully completed analysis output Bam is successfully built but never copied from the /cromwell-executions/ container.

Trigger file is moved into the 'failed' directory with the error "CromwellFailed" in the json output.

Steps to Reproduce
Running the workflow here on a large dataset: https://mondrian-scwgs.github.io/mondrian/#/ with a large output file in the /cromwell-executions/ workflow directory.

Expected behavior
Large output files should be copied into specified output location.

Deployment details: (any information you can provide would be helpful):
Cromwell on Azure 4.5 deployment with no changes to configuration.

Screenshots
Drilled down into AKS workload container 'cromwell' and found the following Exceptions:

image
image

Additional context
Workflow runs with smaller output files using the identical workflow files complete successfully.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions