WIP - Optionally use offload executor in worker #4307

mrocklin · 2020-12-03T18:23:55Z

Normally we have one separate thread pool executor for deserializing data
and another thread pool executor in a worker for execution.
Sometimes this presents a problem because some data types want to be
used in the thread in which they were created. One example of this is
TensorFlow graphs, but there are others.

One way to resolve this is to reuse the same executor in both
situations, and ensure that it has only one thread. This means that
execution and deserialization will block each other (not great) but that
user data will always be operated on in one thread only.

This commit implements that, and drafts up a small test. However, it is
still broken because Dask can be clever in some situations and
deserialize directly on the event loop. This happens for a few reasons
today:

For small messages we deserialize on the event loop for performance
reasons. The user can control this with the
distributed.comm.offload configuration value.

I recommend the value of 1, meaning a single byte
The scheduler-client comms intentionally do not offload today
(grep for the allow_offload=False)

Normally we have one separate thread pool executor for deserializing data and another thread pool executor in a worker for execution. Sometimes this presents a problem because some data types want to be used in the thread in which they were created. One example of this is TensorFlow graphs, but there are others. One way to resolve this is to reuse the same executor in both situations, and ensure that it has only one thread. This means that execution and deserialization will block each other (not great) but that user data will always be operated on in one thread only. This commit implements that, and drafts up a small test. However, it is still broken because Dask can be clever in some situations and deserialize directly on the event loop. This happens for a few reasons today: 1. For small messages we deserialize on the event loop for performance reasons. The user can control this with the distributed.comm.offload configuration value. I recommend the value of 1, meaning a single byte 2. The scheduler-client comms intentionally do not offload today (grep for the `allow_offload=False`)

jrbourbeau · 2020-12-03T22:02:51Z

Nice, this is really interesting. We also don't offload deserialization of tasks on the worker

distributed/distributed/worker.py

Lines 3280 to 3294 in 3407aa3

    
           def _deserialize(function=None, args=None, kwargs=None, task=no_value): 
        
               """ Deserialize task inputs and regularize to func, args, kwargs """ 
        
               if function is not None: 
        
                   function = loads_function(function) 
        
               if args: 
        
                   args = pickle.loads(args) 
        
               if kwargs: 
        
                   kwargs = pickle.loads(kwargs) 
        
               if task is not no_value: 
        
                   assert not function and not args and not kwargs 
        
                   function = execute_task 
        
                   args = (task,) 
        
               return function, args or (), kwargs or {}

so for this to work, we'll need to add some additional offloading logic there

jrbourbeau · 2020-12-04T01:35:00Z

distributed/worker.py

-            function, args, kwargs = _deserialize(*ts.runspec)
+            # Offload deserializing large tasks
+            offload_threshold = get_offload_threshold()
+            if sizeof(ts.runspec) > offload_threshold:


I want to think about if there's a better way to do this. This may lead to a slowdown if we start submitting task deserialization to a busy offloading thread pool.

…ker-offload-executor

mrocklin · 2020-12-04T04:13:19Z

distributed/comm/utils.py

-    if FRAME_OFFLOAD_THRESHOLD and allow_offload:
+    # Offload serializing large frames to improve event loop responsiveness.
+    offload_threshold = get_offload_threshold()
+    if offload_threshold and allow_offload:


Hrm, this may be expensive to do on every message. Thoughts?

It looks like this adds ~3µs:

In [1]: import dask In [2]: from distributed.comm.utils import get_offload_threshold In [3]: %timeit get_offload_threshold() 2.96 µs ± 44.7 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

This change was so we could set distributed.comm.offload at runtime (i.e. with dask.config.set(distributed__comm__offload=1):) instead of just at startup time. Though I agree since is run for every message, the increased overhead may not be worth the extra flexibility.

I reverted the addition of get_offload_threshold and we now just pull in distributed.comm.offload at startup time

distributed/tests/test_worker.py

Otherwise I think that we open ourselves to some slowdown on the event loop and the possibility of deadlocks (maybe?)

jrbourbeau · 2020-12-08T17:28:30Z

distributed/worker.py

-            function, args, kwargs = _deserialize(*ts.runspec)
+            # Offload deserializing large tasks
+            if sizeof(ts.runspec) > OFFLOAD_THRESHOLD:
+                function, args, kwargs = await offload(_deserialize, *ts.runspec)


+1 this is definitely nicer

mrocklin · 2020-12-09T00:22:36Z

OK. This seems ok to me. Merging in.

jakirkham · 2020-12-09T07:49:13Z

cc @Carreau (who IIRC had a similar use case in the past)

mrocklin and others added 2 commits December 3, 2020 10:18

Parse distributed.comm.offload at runtime

ec69429

Offload task deserialization

31f13e1

jrbourbeau reviewed Dec 4, 2020

View reviewed changes

Merge branch 'master' of https://github.com/dask/distributed into wor…

fd57645

…ker-offload-executor

mrocklin commented Dec 4, 2020

View reviewed changes

Don't check distributed.comm.offload during each message

41ae9c3

mrocklin commented Dec 8, 2020

View reviewed changes

distributed/tests/test_worker.py Show resolved Hide resolved

mrocklin and others added 4 commits December 7, 2020 18:38

Add small note to docstring

f460976

Black

eb0afd7

Make ensure_computing async to support deserialization

7c25188

Otherwise I think that we open ourselves to some slowdown on the event loop and the possibility of deadlocks (maybe?)

Use offload method rather than the offload executor directly

3b7343d

jrbourbeau reviewed Dec 8, 2020

View reviewed changes

Be robust to missing constrainted key

0c14366

mrocklin merged commit d54388c into dask:master Dec 9, 2020

mrocklin deleted the worker-offload-executor branch December 9, 2020 00:23

jrbourbeau mentioned this pull request Dec 9, 2020

Delay deserialization of Data in workers until actual usage. #3998

Closed

fjetter mentioned this pull request May 14, 2021

Ensure exceptions in handlers are handled equally for sync and async #4734

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP - Optionally use offload executor in worker #4307

WIP - Optionally use offload executor in worker #4307

Uh oh!

mrocklin commented Dec 3, 2020

Uh oh!

jrbourbeau commented Dec 3, 2020

Uh oh!

jrbourbeau Dec 4, 2020

Uh oh!

mrocklin Dec 4, 2020

Uh oh!

jrbourbeau Dec 4, 2020 •

edited

Loading

Uh oh!

jrbourbeau Dec 4, 2020

Uh oh!

Uh oh!

jrbourbeau Dec 8, 2020

Uh oh!

mrocklin commented Dec 9, 2020

Uh oh!

jakirkham commented Dec 9, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

WIP - Optionally use offload executor in worker #4307

WIP - Optionally use offload executor in worker #4307

Uh oh!

Conversation

mrocklin commented Dec 3, 2020

Uh oh!

jrbourbeau commented Dec 3, 2020

Uh oh!

jrbourbeau Dec 4, 2020

Choose a reason for hiding this comment

Uh oh!

mrocklin Dec 4, 2020

Choose a reason for hiding this comment

Uh oh!

jrbourbeau Dec 4, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jrbourbeau Dec 4, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jrbourbeau Dec 8, 2020

Choose a reason for hiding this comment

Uh oh!

mrocklin commented Dec 9, 2020

Uh oh!

jakirkham commented Dec 9, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jrbourbeau Dec 4, 2020 •

edited

Loading