Skip to content

Conversation

@crusaderky
Copy link
Collaborator

@crusaderky crusaderky commented Mar 10, 2022

Broken out of #5922

  • The {"op": "task-finished"} worker message was sending unused keys "status" and "thread", which were silently swallowed by the **kwargs of transition_processing_memory and transition_waiting_memory.
  • The {"op": "task-erred"} worker message was sending unused keys "status", "thread", and "startstops", which were silently swallowed by the **kwargs of transition_processing_erred.

@crusaderky crusaderky requested a review from fjetter March 10, 2022 18:18
@crusaderky crusaderky self-assigned this Mar 10, 2022
@crusaderky crusaderky marked this pull request as ready for review March 10, 2022 18:23
@crusaderky crusaderky marked this pull request as draft March 10, 2022 18:45
@github-actions
Copy link
Contributor

github-actions bot commented Mar 10, 2022

Unit Test Results

       8 files   -        4         8 suites   - 4   3h 1m 17s ⏱️ - 2h 41m 53s
2 636 tests  -        3  2 525 ✔️  -      31    83 💤 +    3  28 +25 
6 542 runs   - 6 435  6 185 ✔️  - 6 151  301 💤  - 336  56 +52 

For more details on these failures, see this check.

Results for commit cfcf7d6. ± Comparison against base commit f9d2f91.

♻️ This comment has been updated with latest results.

@crusaderky
Copy link
Collaborator Author

Blocked by #5932. As long as it's there I can't tell if I'm introducing regressions or not.

Copy link
Member

@fjetter fjetter left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the tests

FAILED distributed/dashboard/tests/test_scheduler_bokeh.py::test_task_stream
FAILED distributed/dashboard/tests/test_scheduler_bokeh.py::test_task_stream_n_rectangles
FAILED distributed/dashboard/tests/test_scheduler_bokeh.py::test_task_stream_clear_interval
FAILED distributed/dashboard/tests/test_scheduler_bokeh.py::test_lots_of_tasks
FAILED distributed/diagnostics/tests/test_eventstream.py::test_eventstream - ...
FAILED distributed/diagnostics/tests/test_task_stream.py::test_TaskStreamPlugin
FAILED distributed/diagnostics/tests/test_task_stream.py::test_get_task_stream_save

are failing because we remove this information. If we want to clean this up further, we should probably submit a metadata kwarg in the task-erred/finished messages to make this explicit instead of implicit

Comment on lines -2388 to -2389
if ts.startstops:
smsg["startstops"] = ts.startstops
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this is still used for our task stream. Erred tasks should be visualized as black bars. If this is not the case, this might be a regression

"op": "task-erred",
"status": "error",
"key": ts.key,
"thread": self.threads.get(ts.key),
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe this is used in the dashboard to assign the startstops to the proper lane

"status": "OK",
"key": ts.key,
"nbytes": ts.nbytes,
"thread": self.threads.get(ts.key),
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same here, I believe this is required

@crusaderky
Copy link
Collaborator Author

crusaderky commented Mar 25, 2022

I think this seemingly innocent cleanup - which turns out to break everything everywhere - highlights a fundamendal design problem, which is that Worker is sending data to Scheduler but there is no straighforward way to figure out where, if anywhere, it is used - short of trying to blindly remove it and see if any tests break.

Anyway this is on hold for now. I'll see at a later date if there is anything I can salvage out of this.

@crusaderky
Copy link
Collaborator Author

This is ancient and the effort required in trying to salvage any of it would not justify the returns. Closing.

@crusaderky crusaderky closed this Aug 10, 2022
@crusaderky crusaderky deleted the swallow_kwargs branch August 10, 2022 09:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants