Add documentation for state mismatch between db and executor #41593

RNHTTR · 2024-08-19T15:27:15Z

This was discussed at length in this comment to #40468

potiuk

I lke the approach "something wrong" is good enough, it's great that this is actionable.

eladkal · 2024-08-19T16:23:38Z

docs/apache-airflow/troubleshooting.rst

 Troubleshooting
 ===============

+Obscure scheduling failures


I rather we won't place this here.

Our goal should be shorting this doc not adding more stuff to it.

I rather we explain the mechanisem of scheduler-executor dynamic in the relevant section of the docs and then explain the limits/edge cases.

This particular log message ("could not queue task %s (still running after %d attempts)") is essentially useless. The troubleshooting page adds context that we can't fit into a log message. It'd be difficult to provide this context in a way users can find in the normal documentation.

But you need to consider that people land in the doc without hitting the message you are refering to.

I wonder if maybe the message should refer to github discussion arround the problem. I am not a fan of troubleshooting pages. If this is a bug/limitation/problem we don't have a good solution for then it's an open issue.

eladkal · 2024-08-19T16:24:00Z

docs/apache-airflow/troubleshooting.rst

+----------------------------------------------------
+
+This indicates that when the scheduler queried the Airflow database, it observed that the task instance had one status according to the Airflow metadata database,
+but a different status according to the executor. A common example is when the query returned to the scheduler, the task instance was in the ``queued`` status,


What query?

i will clarify which query.

eladkal · 2024-08-19T16:24:13Z

docs/apache-airflow/troubleshooting.rst

+but a different status according to the executor. A common example is when the query returned to the scheduler, the task instance was in the ``queued`` status,
+but the status according to the executor was ``running``.
+
+This mismatch must have persisted for multiple attempts. When this happens, Airflow will not attempt to queue the task. It's possible that something has gone wrong


Attempts=airflow retries?

no -- good call that this needs to be clarified

github-actions · 2024-10-15T00:15:10Z

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 5 days if no further activity occurs. Thank you for your contributions.

RNHTTR added 2 commits August 19, 2024 11:25

record db/executor state mismatch in troubleshooting.rst

0d5bd19

record db/executor state mismatch in troubleshooting.rst

54c7b56

RNHTTR requested review from XD-DENG, ashb, hussein-awala, kaxil, o-nikolas, pierrejeambrun and potiuk as code owners August 19, 2024 15:27

boring-cyborg bot added area:Executors-core LocalExecutor & SequentialExecutor kind:documentation labels Aug 19, 2024

potiuk approved these changes Aug 19, 2024

View reviewed changes

eladkal reviewed Aug 19, 2024

View reviewed changes

hussein-awala approved these changes Aug 21, 2024

View reviewed changes

pierrejeambrun approved these changes Aug 29, 2024

View reviewed changes

github-actions bot added the stale Stale PRs per the .github/workflows/stale.yml policy file label Oct 15, 2024

github-actions bot closed this Oct 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add documentation for state mismatch between db and executor #41593

Add documentation for state mismatch between db and executor #41593

Uh oh!

RNHTTR commented Aug 19, 2024

Uh oh!

potiuk left a comment

Uh oh!

eladkal Aug 19, 2024 •

edited

Loading

Uh oh!

RNHTTR Aug 19, 2024

Uh oh!

eladkal Aug 19, 2024

Uh oh!

eladkal Aug 19, 2024

Uh oh!

RNHTTR Aug 19, 2024

Uh oh!

eladkal Aug 19, 2024

Uh oh!

RNHTTR Aug 19, 2024

Uh oh!

github-actions bot commented Oct 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Add documentation for state mismatch between db and executor #41593

Add documentation for state mismatch between db and executor #41593

Uh oh!

Conversation

RNHTTR commented Aug 19, 2024

Uh oh!

potiuk left a comment

Choose a reason for hiding this comment

Uh oh!

eladkal Aug 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RNHTTR Aug 19, 2024

Choose a reason for hiding this comment

Uh oh!

eladkal Aug 19, 2024

Choose a reason for hiding this comment

Uh oh!

eladkal Aug 19, 2024

Choose a reason for hiding this comment

Uh oh!

RNHTTR Aug 19, 2024

Choose a reason for hiding this comment

Uh oh!

eladkal Aug 19, 2024

Choose a reason for hiding this comment

Uh oh!

RNHTTR Aug 19, 2024

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Oct 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

eladkal Aug 19, 2024 •

edited

Loading