Skip to content

Conversation

@AutomationDev85
Copy link
Contributor

Hi airflow community,
this is my fifth PR and be happy to work DAG parsing runtime. We faced an issue with slow dag parsing if DAG has huge amount of dagruns and DB query is slow down the run time during DAG parsing. Then the DAG parsing was skipped due to long running query and this is not required if DAG is not scheduled. I think DAGs which are not scheduled have biggest change to have huge number dagruns and this save runtime during this use case.

This PR will check is the DAG is not scheduled and then skips the DB query which are not required to be executed.

@AutomationDev85 AutomationDev85 changed the title Added runtime save if dag is not scheduleable Save DAG parsing time if dag is not scheduleable Apr 27, 2023
@AutomationDev85 AutomationDev85 force-pushed the feature/optimize-scheduler-run-time-5 branch from 747a0bb to ec38a14 Compare June 13, 2023 09:55
Copy link
Member

@uranusjr uranusjr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just one suggestion to clarify a comment. Logic looks good to me.

@eladkal eladkal added this to the Airflow 2.7.0 milestone Jul 8, 2023
@eladkal eladkal added the type:improvement Changelog: Improvements label Jul 8, 2023
@potiuk potiuk merged commit a99721d into apache:main Jul 9, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area:core type:improvement Changelog: Improvements

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants