WIP: Ship graphs from client to scheduler with pickle #6028

mrocklin · 2022-03-30T18:39:37Z

This is just a proof of concept right now (many things fail) but this shows that we can ship graphs directly from the client to the scheduler without dealing with any pack/unpack things.

We pickle the entire graph, ship it to the scheduler, where we then do all of the graph manipulation stuff that we used to do on the client.

cc @rjzamora @madsbk @ian-r-rose

mrocklin · 2022-03-30T19:44:23Z

So far I am surprised with how far I was able to get with relatively little lines of code changed.

There are, I'm aware, a whole army of monsters waiting for me. Still though, this feels pretty ok so far.

In dask/distributed#6028 we propose shipping the entire graph to the scheduler with pickle. This removes the need for this custom protocol.

mrocklin · 2022-03-30T22:24:21Z

Dask PR here: #6028

github-actions · 2022-03-31T04:37:22Z

Unit Test Results

      15 files ±  0       15 suites ±0 7h 2m 6s ⏱️ -2s
  2 730 tests +  3   2 645 ✔️ -   1   81 💤 ±  0   4 ❌ +  4
20 201 runs +14 19 242 ✔️ +11 937 💤 - 19 22 ❌ +22

For more details on these failures, see this check.

Results for commit 517cd25. ± Comparison against base commit 5b6a64a.

♻️ This comment has been updated with latest results.

mrocklin · 2022-03-31T13:52:15Z

OK, here is a class of tests that we're going to have problems with:

    @gen_cluster(client=True)
    async def test_robust_unserializable(c, s, a, b):
        class Foo:
            def __getstate__(self):
                raise MyException()
    
        with pytest.raises(MyException):
>           future = c.submit(identity, Foo())

We're genuinely opening up the scheduler to more client issues. I think that the rewards outweigh the risks here, but there's definitely a regression here in terms of hardening.

mrocklin · 2022-03-31T18:20:33Z

OK, I spent a few hours hunting down a few of the test failures. There are a few effects of moving the graph handling onto the scheduler which affect local behavior. There are some things that we just get later in the process, and so they affect when users get negative feedback when they've done something wrong. In practice these changes are small and, I think, not super likely to annoy people. They do exist though.

As an example, if someone has a future from another client, or a cancelled future, we don't actually learn about it until after the graph is constructed and things are resolved on the client. Whereas previously they would learn about this failure on a submit call, they now learn about it on a result call. This is a degredation of experience, but not huge.

There are a few other things I can clean up (there is a long tail here) but in principle I think that the heavy parts of this are done. I would welcome feedback on if this is a direction that we want to pursue.

TODO on this PR

Fixup failing tests
Merge update_graph/update_graph_hlg
Jointly merge in the dask/dask PR

Future Work

Merge TaskGroups with Layers
See how best we can simplify existing layers logic
Remove cull, and replace with some nicer create_graph method

ian-r-rose

Here's some early thoughts, I'm still mulling this over. Correctly forwarding annotations from layers to tasks (especially in the presence of any optimizations) still feels fragile, though the approach you've taken here makes sense.

Pickling certainly seems like a win from the client perspective, as well as for the implementer of HLG layers. It certainly does transfer some pain to the scheduler (which ultimately affects fewer maintainers, I suppose)

distributed/client.py

distributed/scheduler.py

ian-r-rose · 2022-03-31T23:14:07Z

distributed/scheduler.py

+            self.client_desires_keys(keys=keys, client=client)
+            return
+
+        dsk = dict(graph)


If we're moving more graph logic onto the scheduler, I wonder if it would make sense to also start doing some low-level graph optimization here (while retaining any output keys that we need for the result).

It would probably be tricky to get that right in the next block, however, since key names might be rewritten.

Ideally low level graph optimization mostly goes away I think. But in principle I agree that it can be done here. We'll want to be mindful of annotations, but we would need to do that anyway.

I'm hopeful though that high level graph optimization removes the need here.

ian-r-rose · 2022-03-31T23:43:56Z

distributed/tests/test_client.py

    with dask.config.set(optimization__fuse__active=False):
        x = await x.persist()

-    assert all({"workers": (a.address,)} == ts.annotations for ts in s.tasks.values())


Agree it's better to check a single source of truth here (though if I understand the above correctly, the annotations should still be there).

As far as I can tell, annotations still don't work here, correct? The intent seems reasonable to me, though.

This isn't as clear to me that it's desired. I don't mind pulling these out personally. It's an open design choice I think.

ian-r-rose · 2022-03-31T23:47:08Z

distributed/scheduler.py

+            if layer.annotations and "retries" in layer.annotations:
+                retries = retries or {}
+                d = process(layer.annotations["retries"], keys=layer, string_keys=None)
+                retries.update(d)  # TODO: there is an implicit ordering here


The ordering makes sense to me -- less specific to more specific.

Do you still consider these TODO? I think this ordering is fine (even better if it's documented somewhere)

distributed/scheduler.py

ian-r-rose · 2022-05-19T21:25:22Z

A thought I had today (which might be off base):

Previously, using GPU-enabled Dask clusters could be somewhat painful if you don't have a GPU on your local machine (cf rapidsai/cudf#3661). That was one of the motivating use-cases of tools like afar. In this PR, the scheduler is doing much more graph manipulation, and in particular it is serializing tasks for workers. Does this mean that the scheduler would also need to have a GPU in those cases?

jakirkham · 2022-05-19T22:39:14Z

That's already the case as the Scheduler can deserialize things or be directly involved with serialization in some cases. This isn't desirable and something we would ideally fix, but it is likely no worse with this PR than is already the case. Though others more in the weeds here should feel free to correct this if I've missed something

mrocklin · 2022-05-19T23:14:51Z

Does this mean that the scheduler would also need to have a GPU in those cases?

yes

mrocklin · 2022-05-19T23:15:43Z

Heads-up, my plan with this PR is to wait until stability to work is done and we have strong evidence that it's solid. Then we can mint a new version that we're comfortable with and sit with that for a while.

Afterwards I'll work to merge this in and shake things up some more :)

rjzamora · 2022-08-25T13:43:38Z

Regarding this comment in the HLG-roadmap issue: I'd like to help get this PR over the line, but my understanding of the client/scheduler code is quite limited compared to my understanding of dask/dask.

@mrocklin - Could you summarize what you expect the current state of this PR to be? Anything you know to be broken and/or missing? I noticed that many tests fail when I use the nuke-hlg branches in both distributed and dask, and that there are some minor conflicts with main. I will try to get these tests passing, but help/advice is very welcome.

Update nuke-hlg

mrocklin · 2022-08-25T22:04:42Z

@fjetter heads up, it looks like @rjzamora is becoming active here. In principle I think that this is probably a good direction to go in (confidence 80% or so?) it's likely to cause mild havoc though.

rjzamora · 2022-08-26T15:15:18Z

it looks like @rjzamora is becoming active here.

Yes - I started pushing on this a bit yesterday, and ~80% seems like a reasonable confidence level. On the dask/dask and HLG-development side, this is a huge win. However, we are making some clear trade-offs by always pickling HLGs (and there are probably issues we are not even considering yet).

One change that would probably make me feel more confident is if we preserved the old HLG-packing code path for the special case of materialized layers, and provided an option to materialize layers before shipping the hlg. We would still want to remove the dask_distributed_pack methods, but could preserve the pickle-free packing/unpacking logic somewhere. This would provide an escape hatch for users who are running into problems with pickle (maybe they have mismatched python environments, or no gpu on their scheduler, etc.).

mrocklin · 2022-08-26T19:44:02Z

One change that would probably make me feel more confident is if we preserved the old HLG-packing code path for the special case of materialized layers, and provided an option to materialize layers before shipping the hlg. We would still want to remove the dask_distributed_pack methods, but could preserve the pickle-free packing/unpacking logic somewhere. This would provide an escape hatch for users who are running into problems with pickle (maybe they have mismatched python environments, or no gpu on their scheduler, etc.).

I'm inclined towards radical simplification here. I don't think that we should do more half-measures here. I think that we should burn things down. (but others may disagree)

rjzamora · 2022-08-26T19:53:14Z

I'm inclined towards radical simplification here. I don't think that we should do more half-measures here. I think that we should burn things down. (but others may disagree)

I'll submit something to your branch and see what you think. It seems that we can run the exact same code to materialize and process the HLG on either the client or the scheduler. So, I don't see a huge benefit in avoiding the option to run it on the client (yet).

mrocklin · 2022-08-26T20:11:05Z

Could you summarize what you expect the current state of this PR to be? Anything you know to be broken and/or missing? I noticed that many tests fail when I use the nuke-hlg branches in both distributed and dask, and that there are some minor conflicts with main. I will try to get these tests passing, but help/advice is very welcome

I think that I had almost everything running smoothly. There was some trickiness around Variables/Semaphores that I think I worked out almost completely but not entirely.

fjetter · 2022-08-29T12:03:39Z

So far, I only skimmed this PR.

At first glance, it looks like we're putting more emphasize on Client.current which makes me a bit nervous. The Client.current mechanism and specifically the default client mechanism is very complex and at times pretty unreliable, specifically in poorly isolated environments (many threads, asyncio, etc.)

This is a small thing, really, but I think we should use Client.current(allow_default=False) which actually returns None if there is no default client registered. I think this entire API should be cleaned up.

Generally, if this is all about the serializability of Futures, maybe we should figure out how to (de-)serialize futures cleanly without touching client code? I might not have understood the problem, yet. Any pointers appreciated.

One change that would probably make me feel more confident is if we preserved the old HLG-packing code path for the special case of materialized layers, and provided an option to materialize layers before shipping the hlg. We would still want to remove the dask_distributed_pack methods, but could preserve the pickle-free packing/unpacking logic somewhere. This would provide an escape hatch for users who are running into problems with pickle (maybe they have mismatched python environments, or no gpu on their scheduler, etc.).

I'm inclined towards radical simplification here. I don't think that we should do more half-measures here. I think that we should burn things down. (but others may disagree)

How far would the "radical simplifications" go? Would this allow for simplifications in distributed.protocol.serialized, e.g. can we get rid of Serialized?

mrocklin · 2022-10-11T07:35:34Z

Quick update on this. I'm still engaged, but have been pulled off for stability work. I hope that that work finishes up this week and then I'll shift back to this and to shuffling work. If anyone was interested, I'd love to also ge feedback on #6007

…

On Wed, Apr 13, 2022 at 12:17 PM Richard (Rick) Zamora < ***@***.***> wrote: Dask PR here: #6028 <#6028> I assume this was meant to link dask#8864 <dask/dask#8864> - (Just commenting to make that PR a bit easier to find) — Reply to this email directly, view it on GitHub <#6028 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AACKZTCDIFC2J5OQXA6S23TVE36TXANCNFSM5SC2MZOA> . You are receiving this because you were assigned.Message ID: ***@***.***>

mrocklin added 2 commits March 30, 2022 13:35

Ship graphs from client to scheduler with pickle

85c2229

support retries and workers

8cb7efd

mrocklin added 2 commits March 30, 2022 15:08

Support futures without clients

845bc54

pass restrictions and process dicts

250246f

mrocklin mentioned this pull request Mar 30, 2022

Remove dask_distributed_pack methods from HighLevelGraphs and Layers dask/dask#8864

Open

Add support for layer annotations

9f7e297

add restrictions and loose_restrictions back to client side

fee62c5

mrocklin added 2 commits March 31, 2022 08:56

don't require scheduler address in future state

95dc26e

clean up annotations

f385ff6

ian-r-rose self-requested a review March 31, 2022 16:26

mrocklin added 3 commits March 31, 2022 12:21

explicitly pickle in update_graph

c5df850

clean up handling of pickle buffers

be9d8c4

fixup cancelled future test

2e4c4fd

ian-r-rose reviewed Apr 1, 2022

View reviewed changes

crusaderky assigned mrocklin Apr 4, 2022

mrocklin added 8 commits April 10, 2022 09:09

Merge branch 'main' of github.com:dask/distributed into nuke-hlg

0bd96af

import annotations from future

f8b7308

handle workers and allow_other_workers in scheduler

5ae9ff9

make it clear that keys are required

e520e36

cleanup old keywords

3d7ba85

cleanup test_allow_restrictions

8ba663a

support allow_other_workers in annotations

df9dae5

cleanup tests

cc7236e

rjzamora mentioned this pull request May 5, 2022

Fix TypeError: 'Serialize' object is not subscriptable when writing parquet dataset with Client(processes=False) dask/dask#9015

Merged

3 tasks

rjzamora mentioned this pull request Jun 2, 2022

[Discussion] HighLevelGraph Development Roadmap dask/dask#9159

Open

rjzamora mentioned this pull request Jun 24, 2022

[Rough Idea] Remove HighLevelGraph/Layer.cull by simplifying Layer dask/dask#9216

Open

mrocklin mentioned this pull request Jul 6, 2022

Dashboard for failed tasks #6595

Merged

2 tasks

Merge remote-tracking branch 'upsream/main' into HEAD

7f40b51

rjzamora and others added 5 commits August 25, 2022 08:00

redo scheduler.py merge

46f2f06

format

889ac38

go back to using pop

9452ac3

add note

4dc9bc5

Merge pull request #5 from rjzamora/nuke-hlg

16eb77b

Update nuke-hlg

rjzamora mentioned this pull request Sep 20, 2022

Culling massive Blockwise graphs is very slow, not constant-time dask/dask#8570

Open

mrocklin mentioned this pull request Nov 21, 2022

Clean up of unpack_remotedata() #7322

Merged

2 tasks

rjzamora mentioned this pull request Jan 24, 2023

Broadcast join fails when passing a list of columns to merge on dask/dask#9870

Closed

fjetter mentioned this pull request Jan 25, 2023

Race conditions in implicit creation of worker clients when serializing futures resulting in distributed.CancelledErrors #7498

Closed

This was referenced Feb 17, 2023

Abandon encoded tuples as task definition in dsk graphs dask/dask#9969

Open

Use pickle for graph submissions from client to scheduler #7564

Merged

mrocklin requested a review from fjetter as a code owner January 23, 2024 10:57

fjetter closed this Aug 14, 2024

Uh oh!

WIP: Ship graphs from client to scheduler with pickle #6028

WIP: Ship graphs from client to scheduler with pickle #6028

Uh oh!

Conversation

mrocklin commented Mar 30, 2022

Uh oh!

mrocklin commented Mar 30, 2022

Uh oh!

mrocklin commented Mar 30, 2022

Uh oh!

github-actions bot commented Mar 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Unit Test Results

Uh oh!

mrocklin commented Mar 31, 2022

Uh oh!

mrocklin commented Mar 31, 2022

TODO on this PR

Future Work

Uh oh!

ian-r-rose left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ian-r-rose Mar 31, 2022

Choose a reason for hiding this comment

Uh oh!

mrocklin Apr 1, 2022

Choose a reason for hiding this comment

Uh oh!

ian-r-rose Mar 31, 2022

Choose a reason for hiding this comment

Uh oh!

mrocklin Apr 1, 2022

Choose a reason for hiding this comment

Uh oh!

ian-r-rose Mar 31, 2022

Choose a reason for hiding this comment

Uh oh!

ian-r-rose Apr 25, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ian-r-rose commented May 19, 2022

Uh oh!

jakirkham commented May 19, 2022

Uh oh!

mrocklin commented May 19, 2022

Uh oh!

mrocklin commented May 19, 2022

Uh oh!

rjzamora commented Aug 25, 2022

Uh oh!

mrocklin commented Aug 25, 2022

Uh oh!

rjzamora commented Aug 26, 2022

Uh oh!

mrocklin commented Aug 26, 2022

Uh oh!

rjzamora commented Aug 26, 2022

Uh oh!

mrocklin commented Aug 26, 2022

Uh oh!

fjetter commented Aug 29, 2022

Uh oh!

mrocklin commented Oct 11, 2022 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

github-actions bot commented Mar 31, 2022 •

edited

Loading