ARROW-12560: [C++] Add scheduling option for Future callbacks #10258

westonpace · 2021-05-06T09:34:28Z

Previously a future's callbacks would always run synchronously, either as part of Future::MarkFinished or as part of Future::AddCallback. Executor::Transfer made it possible to schedule continuations on a new thread but it would only take effect if the transferred future's callbacks were added before the source future finished. There are times when the desired behavior is to spawn a new thread task even if the source future is finished already.

This PR adds three scheduling options:

Never - The default (and existing) behavior, never spawn a new task
IfUnfinished - Spawn a new task only if the future isn't already finished when the callback is added
Always - Always spawn a new task, on both finished and unfinished futures, regardless of destination thread pool idleness.

The Never option doesn't make any sense for transferring so the transfer only has two choices (always or if unfinished).

github-actions · 2021-05-06T09:34:48Z

https://issues.apache.org/jira/browse/ARROW-12560

westonpace · 2021-05-06T09:34:56Z

CC @pitrou I think you referenced this in your latest execution engine PR.

pitrou · 2021-05-10T13:40:15Z

I'm not sure why you're suggesting to add so much sophistication. To me there are only two interesting options: "always" and "if unfinished". So we could have Transfer (transfer always) vs. TransferUnfinished.

westonpace · 2021-05-25T00:33:07Z

Since I'm working on work stealing at the thread pool level I agree that idle is no longer needed. I've cleaned this up and rebased. It's much simpler than it was before.

westonpace · 2021-05-25T00:36:51Z

Also, I ran into a bit of trouble with the future callback's weak reference to the future. Before we could just assume it was valid since all callbacks were completed before MarkFinished was completed. Now, it is possible for a future to schedule a callback and that callback to far outlive the call to MarkFinished. So now when a callback is scheduled (run on an executor) we make a copy of the FutureImpl's shared_ptr to keep it alive until that callback has a chance to run.

pitrou

Sorry for the delay. This looks good to me on the principle.

cpp/src/arrow/util/future.cc

pitrou · 2021-06-01T08:25:45Z

cpp/src/arrow/util/future.cc

pitrou · 2021-06-01T08:26:58Z

cpp/src/arrow/util/future.cc

Why "a copy"? It's not clear to me where a copy is being made.

The copy is a few lines down when we call shared_from_this. I'll move the comment and make it more explicit.

If it's only the shared_ptr copy, then I'm not sure it's worth mentioning.

I think the more important thing is that we are intentionally extending the lifetime of the future. I reworded the comment a bit and dropped the "copy". I can always remove it if we want.

cpp/src/arrow/util/future.cc

pitrou · 2021-06-01T08:28:38Z

cpp/src/arrow/util/future.cc

The coding conventions prohibit passing mutable lrefs. You could make this a CallbackRecord&&, for example.

pitrou · 2021-06-01T08:38:09Z

cpp/src/arrow/util/future.h

We should avoid using ALL_CAPS names, because of potential clashes with macros (this is a common issue with Windows headers, unfortunately).

Hmm, technically the style guide prefers kAlways but I see Always used more often in Arrow. Although some of the gandiva code uses kAlways. (https://google.github.io/styleguide/cppguide.html#Enumerator_Names). Any preference?

Always sounds fine to me.

Ok. I'll make a PR to add this to the style guide docs as well.

cpp/src/arrow/util/future_test.cc

pitrou · 2021-06-01T08:40:43Z

cpp/src/arrow/util/future_test.cc

It's a bit weird to have this in a private test file, and the mock executor in a .h.

My rationale was only that DelayedExecutor is only used in future_test.cc while MockExecutor is used in future_test.cc and thread_pool_test.cc but I see your point. I'll move this into test_common.h.

pitrou · 2021-06-01T08:41:03Z

cpp/src/arrow/util/future_test.cc

NEVER is never tested?

Added a test.

pitrou · 2021-06-01T08:45:21Z

cpp/src/arrow/util/thread_pool_test.cc

Nit, but it would probably be nicer to be able to spell this as TransferAlways(fut).

westonpace · 2021-06-02T04:06:47Z

@pitrou Don't worry about the delay, I've been plenty busy elsewhere. I have a just a few follow-up questions and then I'll make the changes.

westonpace · 2021-06-05T02:37:20Z

Ok, I've addressed the comments and this is ready for review again.

…hould schedule a new thread task. Previously callbacks always ran synchronously.

…inished has completed so the previous 'rely on WeakFuture being valid' trick no longer worked

…ing.

… shadow a typed enum

pitrou · 2021-06-07T13:30:52Z

Thanks for the update @westonpace . I'll merge once CI passes.

github-actions bot added the Component: C++ label May 6, 2021

westonpace marked this pull request as draft May 6, 2021 09:35

westonpace force-pushed the feature/ARROW-12560--c-investigate-utilizing-aggressive-thread-task branch from c95244a to e992bb6 Compare May 24, 2021 22:05

westonpace marked this pull request as ready for review May 25, 2021 00:33

pitrou requested changes Jun 1, 2021

View reviewed changes

westonpace force-pushed the feature/ARROW-12560--c-investigate-utilizing-aggressive-thread-task branch 3 times, most recently from c038210 to a4bcf0f Compare June 4, 2021 22:41

westonpace requested a review from pitrou June 5, 2021 02:37

westonpace added 8 commits June 7, 2021 15:26

ARROW-12560: Added the ability to specify whether a Future callback s…

1b8a830

…hould schedule a new thread task. Previously callbacks always ran synchronously.

ARROW-12560: Callbacks scheduled using ScheduleAlways run after MarkF…

30e1c65

…inished has completed so the previous 'rely on WeakFuture being valid' trick no longer worked

ARROW-12560: Lint

db73b85

ARROW-12560: Build errors on Windows

689d958

ARROW-12560: WIP

4174fa5

ARROW-12560: Addressing PR comments

2ceb1a5

ARROW-12560: Moved ShouldSchedule to a scoped enum. Fix compiler warn…

7297aed

…ing.

ARROW-12560: Mingw was getting confused and allowing a method name to…

85418af

… shadow a typed enum

pitrou changed the title ~~ARROW-12560: [C++] Investigate utilizing aggressive thread task creation when adding callback to finished future~~ ARROW-12560: [C++] Add scheduling option for Future callbacks Jun 7, 2021

Remove unused includes

747c498

pitrou force-pushed the feature/ARROW-12560--c-investigate-utilizing-aggressive-thread-task branch from be5aa79 to 747c498 Compare June 7, 2021 13:30

pitrou approved these changes Jun 7, 2021

View reviewed changes

pitrou closed this in e7b6c4a Jun 7, 2021

westonpace deleted the feature/ARROW-12560--c-investigate-utilizing-aggressive-thread-task branch January 6, 2022 08:17

asfimport mentioned this pull request Jun 7, 2021

[C++] Investigate utilizing aggressive thread task creation when adding callback to finished future #28320

Closed

ARROW-12560: [C++] Add scheduling option for Future callbacks #10258

ARROW-12560: [C++] Add scheduling option for Future callbacks #10258

Uh oh!

Conversation

westonpace commented May 6, 2021 • edited by pitrou Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented May 6, 2021

Uh oh!

westonpace commented May 6, 2021

Uh oh!

pitrou commented May 10, 2021

Uh oh!

westonpace commented May 25, 2021

Uh oh!

westonpace commented May 25, 2021

Uh oh!

pitrou left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

westonpace commented Jun 2, 2021

Uh oh!

westonpace commented Jun 5, 2021

Uh oh!

pitrou commented Jun 7, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

westonpace commented May 6, 2021 •

edited by pitrou

Loading