Skip to content

Trigger runner process locked with multiple Workflow triggers #50185

@ArvidMartensRenson

Description

@ArvidMartensRenson

Apache Airflow version

3.0.0

If "Other Airflow 2 version" selected, which one?

No response

What happened?

I am running a few dags that can start multiple workflow triggers. When the amount of triggers is low, below 10 or so; all works fine but whenever it goes higher the trigger runner seems to get stuck. I traced it back to one of the dag_count calls on RuntimeTaskInstance getting stuck/taking too long which locks all the other triggers and the syncing of the trigger runner as the SUPERVISOR_COMMS object stays in its locked state. The triggers seems to be started at the same time.

I already tried to solve the problem by introducing a timeout on the count calls but that made the trigger runner crash as the incoming messages are getting mixed up (response of a cancelled call finally getting through). Only solution I see right now is to perform API calls on the public endpoint for the dag count method calls.

What you think should happen instead?

No response

How to reproduce

Have more than twenty or so workflow triggers running started at the same, I suppose they poll states at about the same time.

Operating System

debian 12

Versions of Apache Airflow Providers

No response

Deployment

Official Apache Airflow Helm Chart

Deployment details

No response

Anything else?

No response

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Code of Conduct

Metadata

Metadata

Assignees

Labels

affected_version:3.0Issues Reported for 3.0area:Triggererarea:corekind:bugThis is a clearly a bugpriority:highHigh priority bug that should be patched quickly but does not require immediate new release

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions