Fix `test_balance_expensive_tasks` and improve helper functions in `test_steal.py` #7253

hendrikmakait · 2022-11-03T16:50:31Z

This PR improves the helper functions used in a number of test_balance* tests by simplifying the logic. Inspired by #7250.

Might fix

flaky test: test_balance_multiple_to_replica #7137

cc @crusaderky

Tests added / passed
Passes pre-commit run --all-files

github-actions · 2022-11-03T18:18:17Z

Unit Test Results

See test report for an extended history of previous test failures. This is useful for diagnosing flaky tests.

      24 files ±0       24 suites ±0 10h 12m 11s ⏱️ + 20m 57s
  3 319 tests ±0   3 211 ✔️ +1   105 💤 - 1 3 ❌ ±0
39 132 runs ±0 37 242 ✔️ +1 1 887 💤 - 1 3 ❌ ±0

For more details on these failures, see this check.

Results for commit c0a1abb. ± Comparison against base commit 6015a2c.

♻️ This comment has been updated with latest results.

This reverts commit 3a4f2d5.

distributed/tests/test_steal.py

crusaderky · 2022-11-08T12:42:45Z

distributed/tests/test_steal.py

                config or {},
                {
-                    "distributed.scheduler.unknown-task-duration": "1s",
+                    "distributed.scheduler.default-task-durations": default_task_durations,


Why is this change needed?

This was changed to avoid the reliance on stealing tasks of unknown duration. Instead, we now provide default durations for all relevant tasks. This concern was initially voiced here:

The test implicitly relies on tasks of unknown duration to be stolen (#5572). It should be changed not to rely on this specific use case.

#7243 (comment)

crusaderky · 2022-11-08T12:51:43Z

distributed/tests/test_steal.py

+                list(key_split(key[5:]))  # Remove "task-" prefix
+                for key in w.data.keys()
+                if key.startswith("task-")
+            ]


Could you explain the reasoning behind this change?

The tests should now check where the tasks have eventually been executed after stealing took place. This is the behavior we want to test here and relies less on internals as the previous version. The latter relied on checking the scheduler's worker state after stealing but during processing.

Co-authored-by: crusaderky <crusaderky@gmail.com>

hendrikmakait · 2022-11-11T10:00:47Z

CI seems to be broken, I'm not sure what's causing this.

crusaderky · 2022-11-14T12:52:38Z

I fixed a trivial issue but test_balance_expensive_tasks is still failing

hendrikmakait · 2022-11-14T15:07:28Z

I fixed a trivial issue but test_balance_expensive_tasks is still failing

Thanks for fixing the pre-commit issue, @crusaderky, that one totally tripped me up because I only checked the code on the branch but not the diff against main. I managed to fix test_balance_expensive_tasks by further adjusting the test setup helpers and slightly changing the tests. This was necessary since occupancy slightly changed with the new test helpers. This should be good for another review now.

hendrikmakait · 2022-11-14T17:15:36Z

After test_balance_expensive_tasks flaked 25% of the time on CI, I investigated and found that heartbeats affected the cost estimation, e.g., adding await asyncio.sleep(1) before the calls to steal.balance() would reliably fail the test. I've adjusted the code to avoid heartbeats and the test has not flaked in 2000x local runs. For comparison, before it flaked once in 500 runs. As long as we don't see any problems on CI, I'd finally consider this problem fixed.

hendrikmakait · 2022-11-14T17:25:39Z

...still investigating after CI keeps failing.

hendrikmakait added 5 commits November 3, 2022 16:15

Refactor assert_balanced

e9928dc

Add explanation

4dfaee8

Minor

21a6c91

Simplify _run_dependency_balance_test

9bc0afc

Configure default durations instead of unknown durations

ee13aa1

hendrikmakait changed the title ~~Improve robustness of helper functions in test_steal~~ Improve robustness of helper functions in test_steal.py Nov 3, 2022

hendrikmakait added 5 commits November 3, 2022 20:00

Trigger CI

a777cd4

Deterministic ordering for stealable

3a4f2d5

Retries

89ba73d

Revert "Deterministic ordering for stealable"

ba83af2

This reverts commit 3a4f2d5.

Minor reordering and FIXMEs

ee4ae2a

hendrikmakait self-assigned this Nov 4, 2022

crusaderky reviewed Nov 8, 2022

View reviewed changes

hendrikmakait and others added 2 commits November 8, 2022 14:33

Update distributed/tests/test_steal.py

0801df9

Co-authored-by: crusaderky <crusaderky@gmail.com>

Trigger CI

a5c5f1e

crusaderky added 2 commits November 14, 2022 12:25

Merge branch 'main' into improve-test-balance

d70b9a9

fix CI

548deed

Fix test_balance_expensive_tasks

ea54c98

hendrikmakait marked this pull request as draft November 14, 2022 16:07

Avoid heartbeats

3f53fda

hendrikmakait added 2 commits November 25, 2022 15:57

Merge branch 'main' into improve-test-balance

0b23d8f

Fix test

2365f7a

hendrikmakait marked this pull request as ready for review November 28, 2022 10:48

Add assertion to avoid mistakes in the future

7bcb4d1

hendrikmakait requested a review from crusaderky December 20, 2022 15:19

hendrikmakait added 2 commits December 20, 2022 16:57

Merge branch 'main' into improve-test-balance

cbbae16

Merge branch 'main' into improve-test-balance

c0a1abb

hendrikmakait changed the title ~~Improve robustness of helper functions in test_steal.py~~ Fix test_balance_expensive_tasks and improve helper functions in test_steal.py Jan 23, 2023

crusaderky approved these changes Jan 24, 2023

View reviewed changes

crusaderky merged commit 99d4112 into dask:main Jan 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix `test_balance_expensive_tasks` and improve helper functions in `test_steal.py` #7253

Fix `test_balance_expensive_tasks` and improve helper functions in `test_steal.py` #7253

Uh oh!

hendrikmakait commented Nov 3, 2022 •

edited

Loading

Uh oh!

github-actions bot commented Nov 3, 2022 •

edited

Loading

Uh oh!

Uh oh!

crusaderky Nov 8, 2022

Uh oh!

hendrikmakait Nov 8, 2022

Uh oh!

crusaderky Nov 8, 2022

Uh oh!

hendrikmakait Nov 8, 2022

Uh oh!

hendrikmakait commented Nov 11, 2022

Uh oh!

crusaderky commented Nov 14, 2022

Uh oh!

hendrikmakait commented Nov 14, 2022

Uh oh!

hendrikmakait commented Nov 14, 2022

Uh oh!

hendrikmakait commented Nov 14, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Fix test_balance_expensive_tasks and improve helper functions in test_steal.py #7253

Fix test_balance_expensive_tasks and improve helper functions in test_steal.py #7253

Uh oh!

Conversation

hendrikmakait commented Nov 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Nov 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Unit Test Results

Uh oh!

Uh oh!

crusaderky Nov 8, 2022

Choose a reason for hiding this comment

Uh oh!

hendrikmakait Nov 8, 2022

Choose a reason for hiding this comment

Uh oh!

crusaderky Nov 8, 2022

Choose a reason for hiding this comment

Uh oh!

hendrikmakait Nov 8, 2022

Choose a reason for hiding this comment

Uh oh!

hendrikmakait commented Nov 11, 2022

Uh oh!

crusaderky commented Nov 14, 2022

Uh oh!

hendrikmakait commented Nov 14, 2022

Uh oh!

hendrikmakait commented Nov 14, 2022

Uh oh!

hendrikmakait commented Nov 14, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix `test_balance_expensive_tasks` and improve helper functions in `test_steal.py` #7253

Fix `test_balance_expensive_tasks` and improve helper functions in `test_steal.py` #7253

hendrikmakait commented Nov 3, 2022 •

edited

Loading

github-actions bot commented Nov 3, 2022 •

edited

Loading