Skip to content

Conversation

@madsbk
Copy link
Contributor

@madsbk madsbk commented Nov 17, 2022

Separate the handling of futures with the argument of a SubgraphCallable and futures within the SubgraphCallable itself.

Closes #7289
Closes #7303

  • Tests added / passed
  • Passes pre-commit run --all-files

@github-actions
Copy link
Contributor

github-actions bot commented Nov 17, 2022

Unit Test Results

See test report for an extended history of previous test failures. This is useful for diagnosing flaky tests.

       15 files  ±  0         15 suites  ±0   6h 44m 21s ⏱️ - 15m 51s
  3 226 tests +  5    3 141 ✔️ +  4    84 💤 ±0  1 +1 
23 852 runs  +35  22 942 ✔️ +34  909 💤 ±0  1 +1 

For more details on these failures, see this check.

Results for commit dd7479a. ± Comparison against base commit 69b74f9.

♻️ This comment has been updated with latest results.

@mrocklin
Copy link
Member

This does appear to fix the issue that I was seeing. I'm curious, what is your understanding of the problem and what did you do here to fix it?

@madsbk
Copy link
Contributor Author

madsbk commented Nov 17, 2022

This does appear to fix the issue that I was seeing. I'm curious, what is your understanding of the problem and what did you do here to fix it?

We need to do two things when unpacking futures surrounding SubgraphCallable:

  • We have to unpack all futures in the graph of the SubgraphCallable and add them to the input keys of the SubgraphCallable.
  • Also, we have to unpack futures in the arguments given to the SubgraphCallable. Since they are not used within the SubgraphCallable itself, they do not need to be added to the input keys of the SubgraphCallable, which is what we use to do.

@madsbk madsbk marked this pull request as ready for review November 17, 2022 15:11
@mrocklin
Copy link
Member

@fjetter is this something that you and your team can help to review?

@mrocklin
Copy link
Member

Also I hope that eventually something like #6028 makes things like this irrelevant

@fjetter
Copy link
Member

fjetter commented Nov 21, 2022

Changes to the code appear to be straightforward. Can we add a test for #7289 please? It sounds like we should be able to test this by asserting that the object is not in the output multiple times, shouldn't we?

@madsbk
Copy link
Contributor Author

madsbk commented Nov 21, 2022

Changes to the code appear to be straightforward. Can we add a test for #7289 please? It sounds like we should be able to test this by asserting that the object is not in the output multiple times, shouldn't we?

@mrocklin do you have a small reproducer? Otherwise, I can try to make a synthetic one.

@mrocklin
Copy link
Member

mrocklin commented Nov 21, 2022 via email

@madsbk
Copy link
Contributor Author

madsbk commented Nov 22, 2022

@fjetter added a test: dd7479a

Copy link
Member

@fjetter fjetter left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @madsbk !

@fjetter fjetter merged commit cff33d5 into dask:main Nov 24, 2022
@madsbk madsbk deleted the unpack_remotedata_cleanup branch December 7, 2022 13:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Numba serialization is slow sometimes

3 participants