AIP-67 - Multi Team: Update Celery Executor to support multi team by o-nikolas · Pull Request #60675 · apache/airflow

o-nikolas · 2026-01-17T01:01:27Z

Updating the Celery executor to read from team based config and also support multiple instances running concurrently.

The latter is the largest source of changes. Much of the celery configuration (both Airflow config and Celery config) was module based. Modules are cached and shared in Python. So the majority of the changes are moving that module level config code to be function based (while trying to also maintain backwards compatibility).

The way Celery tasks are sent to workers also changed as a consequence of this. Since sending tasks is parallelized with multiple processes (which do not share memory with the parent) the send task logic now re-creates a celery app in the sub processes (since the pickling and unpickling that python does to try pass state to the sub processes was not reliably creating the correct celery app objects).

Was generative AI tooling used to co-author this PR?

Yes (please specify the tool below)
Cline

Read the Pull Request Guidelines for more information. Note: commit author/co-author name and email in commits become permanently public when merged.
For fundamental code changes, an Airflow Improvement Proposal (AIP) is needed.
When adding dependency, check compliance with the ASF 3rd Party License Policy.
For significant user-facing changes create newsfragment: {pr_number}.significant.rst or {issue_number}.significant.rst, in airflow-core/newsfragments.

Updating the Celery executor to read from team based config and also support multiple instances running concurrently. The latter is the largest source of changes. Much of the celery configuration (both Airflow config and Celery config) was module based. Modules are cached and shared in Python. So the majority of the changes are moving that module level config code to be function based (while trying to also maintain backwards compatibility). The way Celery tasks are sent to workers also changed as a consequence of this. Since sending tasks is parallelized with multiple processes (which do not share memory with the parent) the send task logic now re-creates a celery app in the sub processes (since the pickling and unpickling that python does to try pass state to the sub processes was not reliably creating the correct celery app objects).

dheerajturaga

Thanks for the enhancements! the current implementation introduces static type errors that will fail our CI pipeline. I have provided inline patches to correct the type signatures and ensure compliance with our MyPy configuration.

providers/celery/src/airflow/providers/celery/executors/celery_executor_utils.py

providers/celery/tests/integration/celery/test_celery_executor.py

dheerajturaga · 2026-01-18T21:33:04Z

@o-nikolas I have tested the general functionality of celery worker with your changes as well as tested out the CLI. Things are working as expected. However, I don't see how I can test the --team option. Seems like if I set a team the jobs still seem to be launching on the worker. How do you propose I test the multi team support here?

As inferred by the presence of the correct ExecutorConf methods being available.

o-nikolas · 2026-01-20T01:44:36Z

@o-nikolas I have tested the general functionality of celery worker with your changes as well as tested out the CLI. Things are working as expected. However, I don't see how I can test the --team option. Seems like if I set a team the jobs still seem to be launching on the worker. How do you propose I test the multi team support here?

Thanks for the thorough review @dheerajturaga! I've addressed those issues (slightly differently for one than your suggested patch). I'm currently struggling with back compat tests. It's slow/difficult because those tests to do not run successfully in breeze on my laptop. So I have to push to the PR to test each change.

As far as testing with the --team flag. For this you have to have a full multi-team setup, which we don't have great documentation for yet (coming soon). The most helpful testing is actually on the backcompat side (testing with airflow 2.11 and 3.1.X)

providers/celery/src/airflow/providers/celery/executors/celery_executor.py

providers/celery/src/airflow/providers/celery/executors/celery_executor_utils.py

dheerajturaga

@o-nikolas , I have run backward compatibility checks. Things work good in 2.11.0 however when I tried this with 3.1.3 I found issues. There are incomplete API contract in the ExecutorConf class between Airflow versions.

providers/celery/src/airflow/providers/celery/cli/celery_command.py

dheerajturaga

Hopefully the final set of changes needed to be consistent. Everything else looks good.

providers/celery/src/airflow/providers/celery/cli/definition.py

providers/celery/src/airflow/providers/celery/cli/celery_command.py

Co-authored-by: Dheeraj Turaga <dheerajturaga@gmail.com>

Request for changes has been left after fixes were applied.

dheerajturaga

Awesome! Thanks so much for your patience, I know there was a lot of back and forth. Current changes look good!

o-nikolas · 2026-02-01T00:26:56Z

Awesome! Thanks so much for your patience, I know there was a lot of back and forth. Current changes look good!

No worries @dheerajturaga! I appreciate the thorough review and testing :) I made sure to commit one of your suggestions to get you tagged as a co-author for your efforts!

…ache#60675) * Update Celery Executor to support multi team Updating the Celery executor to read from team based config and also support multiple instances running concurrently. The latter is the largest source of changes. Much of the celery configuration (both Airflow config and Celery config) was module based. Modules are cached and shared in Python. So the majority of the changes are moving that module level config code to be function based (while trying to also maintain backwards compatibility). The way Celery tasks are sent to workers also changed as a consequence of this. Since sending tasks is parallelized with multiple processes (which do not share memory with the parent) the send task logic now re-creates a celery app in the sub processes (since the pickling and unpickling that python does to try pass state to the sub processes was not reliably creating the correct celery app objects). * Fixes from PR CI * Mypy sometimes makes code worse * Fallback to global conf if we're not running on 3.2+ airflow As inferred by the presence of the correct ExecutorConf methods being available. * More backcompat * Skip multi team tests if not 3.2 * 3.2 not 3.1 * Add type annotation for create_celery_app * Conditional or TYPE_CHECKING imports of ExercutorConfig * More type fixups * Use explicit version compat checks rather than trying to infer * Test back compat on celery command * fixup * Apply suggestions from code review Co-authored-by: Dheeraj Turaga <dheerajturaga@gmail.com> * New exception for unit test --------- Co-authored-by: Dheeraj Turaga <dheerajturaga@gmail.com>

…n test The test constructed ExecuteTask and TaskInstance via model_construct(), which bypasses Pydantic validation. Fields added or made required by apache#50825 (dag_version_id, pool_slots) and inherited from BaseDagBundleWorkload (token, dag_rel_path, bundle_info, log_path) were missing. This went unnoticed until apache#60675 changed task dispatch to run in ProcessPoolExecutor subprocesses where mock patches don't apply, causing the real execute_workload (with full schema validation) to run on the worker.

…ache#60675) * Update Celery Executor to support multi team Updating the Celery executor to read from team based config and also support multiple instances running concurrently. The latter is the largest source of changes. Much of the celery configuration (both Airflow config and Celery config) was module based. Modules are cached and shared in Python. So the majority of the changes are moving that module level config code to be function based (while trying to also maintain backwards compatibility). The way Celery tasks are sent to workers also changed as a consequence of this. Since sending tasks is parallelized with multiple processes (which do not share memory with the parent) the send task logic now re-creates a celery app in the sub processes (since the pickling and unpickling that python does to try pass state to the sub processes was not reliably creating the correct celery app objects). * Fixes from PR CI * Mypy sometimes makes code worse * Fallback to global conf if we're not running on 3.2+ airflow As inferred by the presence of the correct ExecutorConf methods being available. * More backcompat * Skip multi team tests if not 3.2 * 3.2 not 3.1 * Add type annotation for create_celery_app * Conditional or TYPE_CHECKING imports of ExercutorConfig * More type fixups * Use explicit version compat checks rather than trying to infer * Test back compat on celery command * fixup * Apply suggestions from code review Co-authored-by: Dheeraj Turaga <dheerajturaga@gmail.com> * New exception for unit test --------- Co-authored-by: Dheeraj Turaga <dheerajturaga@gmail.com>

o-nikolas requested review from dheerajturaga and hussein-awala as code owners January 17, 2026 01:01

o-nikolas added full tests needed We need to run full set of tests for this PR to merge multi-team - aip-67 Issues related to multi-team (AIP-67) labels Jan 17, 2026

boring-cyborg bot added area:providers provider:celery labels Jan 17, 2026

dheerajturaga reviewed Jan 17, 2026

View reviewed changes

providers/celery/src/airflow/providers/celery/executors/celery_executor_utils.py Outdated Show resolved Hide resolved

providers/celery/tests/integration/celery/test_celery_executor.py Outdated Show resolved Hide resolved

o-nikolas requested a review from vincbeck January 19, 2026 17:38

o-nikolas added 4 commits January 19, 2026 11:34

Fixes from PR CI

f6d1544

Mypy sometimes makes code worse

c014748

Fallback to global conf if we're not running on 3.2+ airflow

40a65e2

As inferred by the presence of the correct ExecutorConf methods being available.

More backcompat

c737b4a

o-nikolas added 3 commits January 20, 2026 00:00

Skip multi team tests if not 3.2

637210b

3.2 not 3.1

9865aeb

Merge branch 'main' into onikolas/multi_team_celery_executor

b52e6e5

vincbeck reviewed Jan 21, 2026

View reviewed changes

providers/celery/src/airflow/providers/celery/executors/celery_executor.py Show resolved Hide resolved

providers/celery/src/airflow/providers/celery/executors/celery_executor_utils.py Outdated Show resolved Hide resolved

Add type annotation for create_celery_app

53aad92

vincbeck approved these changes Jan 21, 2026

View reviewed changes

o-nikolas added 2 commits January 21, 2026 12:17

Conditional or TYPE_CHECKING imports of ExercutorConfig

ba969b0

More type fixups

393fba9

o-nikolas mentioned this pull request Jan 21, 2026

AIP 67 - Mult Team: Updating Executors to Support Multi Team #60912

Open

8 tasks

dheerajturaga previously requested changes Jan 24, 2026

View reviewed changes

providers/celery/src/airflow/providers/celery/cli/celery_command.py Show resolved Hide resolved

providers/celery/src/airflow/providers/celery/cli/celery_command.py Show resolved Hide resolved

Use explicit version compat checks rather than trying to infer

c5e8a03

dheerajturaga reviewed Jan 28, 2026

View reviewed changes

providers/celery/src/airflow/providers/celery/cli/celery_command.py Show resolved Hide resolved

o-nikolas added 2 commits January 29, 2026 09:59

Test back compat on celery command

e54bbdc

fixup

7bb41d5

dheerajturaga self-requested a review January 29, 2026 23:17

dheerajturaga reviewed Jan 30, 2026

View reviewed changes

Apply suggestions from code review

1be68d5

Co-authored-by: Dheeraj Turaga <dheerajturaga@gmail.com>

o-nikolas requested a review from dheerajturaga January 30, 2026 17:41

New exception for unit test

4000eaa

dheerajturaga approved these changes Jan 30, 2026

View reviewed changes

dheerajturaga merged commit 58f785a into apache:main Jan 30, 2026
129 checks passed

dheerajturaga mentioned this pull request Feb 1, 2026

AIP-67 Fix incomplete workload payloads in Celery executor integration test #61332

Open

1 task

Conversation

o-nikolas commented Jan 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Was generative AI tooling used to co-author this PR?

Uh oh!

dheerajturaga left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dheerajturaga commented Jan 18, 2026

Uh oh!

o-nikolas commented Jan 20, 2026

Uh oh!

Uh oh!

Uh oh!

dheerajturaga left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dheerajturaga left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dheerajturaga left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

o-nikolas commented Feb 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

o-nikolas commented Jan 17, 2026 •

edited

Loading