-
-
Notifications
You must be signed in to change notification settings - Fork 748
Do not mark tests xfailed if cluster doesn't come up in time #5824
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Do not mark tests xfailed if cluster doesn't come up in time #5824
Conversation
Unit Test Results 12 files + 1 12 suites +1 7h 27m 38s ⏱️ + 52m 50s For more details on these failures, see this check. Results for commit c6f8f6c. ± Comparison against base commit d2d76c0. ♻️ This comment has been updated with latest results. |
87454e5 to
c6f8f6c
Compare
|
It appears something is actually wrong. The ubu/py3.9 job is running for more than an hour without any stdout |
|
The timed out job actually finished running the test suite as expected without problems, all green. https://github.com/dask/distributed/runs/5248080147?check_suite_focus=true However, is hang during teardown Anybody seen this before? |
|
This stuck test job appears to be an old problem, see #2925 #3885 for an attempt to implement the hotfix What's happening is that the weakref finalizer of a SSH process tries to close it but that apparently never finishes. This behaviour is confusing me since based on the above issue I would expect it to return but keep a zombie process around |
|
I'll move on with this change since I believe raising the exceptions explicitly is valuable and I don't see how the ssh problem could connect to the changes I'm proposing but if it happens again after merging we can revert this again. |
Not the most elegant solution but should do the trick