[Ballista] Streaming style push-based shuffle and All-at-once stage scheduling in Ballista #1842

mingmwang · 2022-02-16T09:57:57Z

Which issue does this PR close?

Closes #1805.

Rationale for this change

Add a new streaming style push based shuffle implementation.

What changes are included in this PR?

new stream shuffle reader implementation
PushPartition gRpc call in Arrow-Flight
All-at-once stage scheduler

Are there any user-facing changes?

No

…cheduling in Ballista

thinkharderdev · 2022-02-18T11:23:57Z

ballista/rust/scheduler/src/planner.rs

+        // re-plan the input execution plan and create All-at-once query stages.
+        // Now we just simply depends on the the stage count to decide whether to create All-at-once or normal stages.
+        // In future, we can have more sophisticated way to decide which way to go.
+        if stages.len() > 1 && stages.len() <= 4 {


If I understand the original design correctly, the "all-at-once" plan will only get scheduled when there are sufficient task slots available to run the entire plan. So should this be a function of the total number of partitions?

If I understand the original design correctly, the "all-at-once" plan will only get scheduled when there are sufficient task slots available to run the entire plan. So should this be a function of the total number of partitions?

Yes, you are right. But currently the scheduler server doesn't have a clear view of how many task slots available. So here I just add simple check on the stage count. After @yahoNanJing refactor the scheduler state and keep more cpu/task info into the memory state, we can add more sophisticated check logic.

houqp · 2022-02-21T06:27:36Z

ballista/rust/core/src/execution_plans/shuffle_stream_reader.rs

+
+        // let schema = &self.schema;
+        // let rx = self.batch_receiver.lock().unwrap().pop().unwrap();
+        // let join_handle = tokio::task::spawn(async move {});
+        // Ok(RecordBatchReceiverStream::create(schema, rx, join_handle))


Suggested change

// let schema = &self.schema;

// let rx = self.batch_receiver.lock().unwrap().pop().unwrap();

// let join_handle = tokio::task::spawn(async move {});

// Ok(RecordBatchReceiverStream::create(schema, rx, join_handle))

houqp · 2022-02-21T06:54:58Z

ballista/rust/scheduler/src/planner.rs

+        info!("planning query stages for job {}", job_id);
+        let (modified_plan, mut stages) = self
+            .plan_query_stages_internal(job_id, execution_plan.clone())


I think this block is only used in the else branch below when all at once mode is disabled?

alamb · 2022-04-15T14:50:30Z

marking as draft (so it is easer to see what PRs are waiting for review)

andygrove · 2022-11-16T16:53:53Z

Closing this PR since it has not been updated in a long time. Feel free to re-open if this is still being worked on.

[Ballista] Streaming style push-based shuffle and All-at-once stage s…

9182149

…cheduling in Ballista

github-actions bot added ballista labels Feb 16, 2022

Dandandan requested a review from andygrove February 16, 2022 11:01

thinkharderdev reviewed Feb 18, 2022

View reviewed changes

yahoNanJing mentioned this pull request Feb 21, 2022

[Ballista] Introduce QueryStageScheduler for better managing the stage-based task scheduling #1704

Closed

houqp reviewed Feb 21, 2022

View reviewed changes

yahoNanJing mentioned this pull request Mar 12, 2022

Introduce Ballista query stage scheduler #1935

Merged

alamb marked this pull request as draft April 15, 2022 14:50

thinkharderdev mentioned this pull request May 20, 2022

[Discuss] Ballista Future Direction apache/datafusion-ballista#30

Open

andygrove removed the datafusion label Jun 3, 2022

andygrove closed this Nov 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Ballista] Streaming style push-based shuffle and All-at-once stage scheduling in Ballista #1842

[Ballista] Streaming style push-based shuffle and All-at-once stage scheduling in Ballista #1842

Uh oh!

mingmwang commented Feb 16, 2022

Uh oh!

thinkharderdev Feb 18, 2022

Uh oh!

mingmwang Feb 21, 2022

Uh oh!

houqp Feb 21, 2022

Uh oh!

houqp Feb 21, 2022

Uh oh!

alamb commented Apr 15, 2022

Uh oh!

andygrove commented Nov 16, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[Ballista] Streaming style push-based shuffle and All-at-once stage scheduling in Ballista #1842

[Ballista] Streaming style push-based shuffle and All-at-once stage scheduling in Ballista #1842

Uh oh!

Conversation

mingmwang commented Feb 16, 2022

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

Uh oh!

thinkharderdev Feb 18, 2022

Choose a reason for hiding this comment

Uh oh!

mingmwang Feb 21, 2022

Choose a reason for hiding this comment

Uh oh!

houqp Feb 21, 2022

Choose a reason for hiding this comment

Uh oh!

houqp Feb 21, 2022

Choose a reason for hiding this comment

Uh oh!

alamb commented Apr 15, 2022

Uh oh!

andygrove commented Nov 16, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants