[BEAM-9822] Disable grouping when streaming #11532

nielm · 2020-04-27T08:14:18Z

Grouping adds significant latency and memory use.
When streaming, end-to-end pipeline latency is important, and many worker threads are executed, meaning that OOM's can frequently occur.

This PR disables grouping by default in streaming mode, ensuring lower memory use and faster end-end latency.

Note, this PR is dependent on PR #11528

Post-Commit Tests Status (on master branch)

Lang	SDK	Apex	Dataflow	Gearpump	Samza
Go		---	---	---	---
Java
Python		---		---	---
XLang	---	---	---	---	---

Pre-Commit Tests Status (on master branch)

---	Java	Python	Go	Website
Non-portable
Portable	---		---	---

See .test-infra/jenkins/README for trigger phrase, status and link of all Jenkins jobs.

allenpradeep · 2020-05-01T19:58:38Z

...ava/io/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/spanner/SpannerIO.java

I was wondering if this condition needs to be based on the input passed to this stage or based on some parameter from the user?

it's kinda both!
If the source is unbounded (streaming) - and the groupingFactor has not been specified by the user, then default to no grouping.

Is there any chance that someone using SpannerIO in a streaming pipeline is relying on the default grouping factor being 1000? I'm concerned this backwards-incompatible change could break them. Would it be sufficient to just give users the option to disable batching by setting the grouping factor to 1?

They already can set groupingFactorb to 1 if they want...
Breaking backward compatibility: unlikely.

The default of 1000 causes OOMs when using streaming, with wide windows, and high throughput... When this happens, it is not always obvious that grouping is the issue...

With smaller windows/less throughput, it is much less likely that a group will be filled, (groups are bounded by bundles, which are bounded by windows)., So it is unlikely that anyone ever got to fill the group with 1000 batches.

They already can set groupingFactorb to 1 if they want...

Ha yeah sorry that was unclear. At the time I thought that groupingFactor = 1 enabled the optimization in #11529, so I was wondering if this was really necessary since users could just enable them by setting grouping factor manually. But I see now that grouping is separate from batching. And its disabling batching that enables your other PR.

allenpradeep · 2020-05-06T17:36:20Z

LGTM.

nielm · 2020-05-18T23:45:29Z

Retest this please

TheNeuralBit · 2020-05-18T23:55:52Z

Retest this please

TheNeuralBit · 2020-05-18T23:59:17Z

Looks like you need to run spotless to auto-format. You can use ./gradlew spotlessApply to do that locally (may need to do it on the other PRs as well)

Grouping adds significant latency and memory use, and when streaming this causes both OOMs and high pipeline latencies.

nielm · 2020-05-19T11:07:22Z

Retest this please

nielm · 2020-05-19T11:07:45Z

Looks like you need to run spotless to auto-format.
Done - sorry!

TheNeuralBit · 2020-05-19T15:31:02Z

Retest this please

TheNeuralBit · 2020-05-19T15:37:46Z

Retest this please

TheNeuralBit · 2020-05-19T15:48:27Z

...ava/io/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/spanner/SpannerIO.java

+                                  .orElse(
+                                      input.isBounded() == IsBounded.BOUNDED
+                                          ? DEFAULT_GROUPING_FACTOR
+                                          : 1),


It would be nice if this were another constant. We could have DEFAULT_GROUPING_FACTOR_BOUNDED and DEFAULT_GROUPING_FACTOR_UNBOUNDED. It doesn't need to be done here, could be in a follow-up PR.

Will do in a separate pr

TheNeuralBit

LGTM, thanks! I'll go ahead and merge this, but can you add a note about this in CHANGES.md in another PR?

probot-autolabeler bot added gcp io java labels Apr 27, 2020

nielm mentioned this pull request Apr 27, 2020

[BEAM-9822] Simplify pipeline when batching is disabled. #11529

Merged

nielm changed the title ~~Disable grouping streaming~~ []BEAM-9822] Disable grouping streaming Apr 27, 2020

nielm force-pushed the disableGroupingStreaming branch 2 times, most recently from 8b56f44 to 865a11b Compare April 27, 2020 11:14

nielm changed the title ~~[]BEAM-9822] Disable grouping streaming~~ [BEAM-9822] Disable grouping when streaming Apr 27, 2020

nielm mentioned this pull request Apr 29, 2020

[BEAM-10047] Merge the stages 'Gather and Sort' and 'Create Batches' #11570

Merged

nielm force-pushed the disableGroupingStreaming branch from 865a11b to 573bfb9 Compare May 1, 2020 11:22

allenpradeep reviewed May 1, 2020

View reviewed changes

nielm force-pushed the disableGroupingStreaming branch from 573bfb9 to cecb959 Compare May 3, 2020 10:14

allenpradeep approved these changes May 6, 2020

View reviewed changes

nielm force-pushed the disableGroupingStreaming branch 2 times, most recently from 9e56c08 to 3cc214b Compare May 18, 2020 23:42

Disable grouping by default when streaming.

d64df6a

Grouping adds significant latency and memory use, and when streaming this causes both OOMs and high pipeline latencies.

nielm force-pushed the disableGroupingStreaming branch from 3cc214b to d64df6a Compare May 19, 2020 11:07

TheNeuralBit reviewed May 19, 2020

View reviewed changes

TheNeuralBit approved these changes May 19, 2020

View reviewed changes

TheNeuralBit merged commit 3f2d648 into apache:master May 19, 2020

[BEAM-9822] Disable grouping when streaming #11532

[BEAM-9822] Disable grouping when streaming #11532

Uh oh!

Conversation

nielm commented Apr 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Post-Commit Tests Status (on master branch)

Pre-Commit Tests Status (on master branch)

Uh oh!

allenpradeep May 1, 2020

Choose a reason for hiding this comment

Uh oh!

nielm May 18, 2020

Choose a reason for hiding this comment

Uh oh!

TheNeuralBit May 19, 2020

Choose a reason for hiding this comment

Uh oh!

nielm May 19, 2020

Choose a reason for hiding this comment

Uh oh!

TheNeuralBit May 19, 2020

Choose a reason for hiding this comment

Uh oh!

allenpradeep commented May 6, 2020

Uh oh!

nielm commented May 18, 2020

Uh oh!

TheNeuralBit commented May 18, 2020

Uh oh!

TheNeuralBit commented May 18, 2020

Uh oh!

nielm commented May 19, 2020

Uh oh!

nielm commented May 19, 2020

Uh oh!

TheNeuralBit commented May 19, 2020

Uh oh!

TheNeuralBit commented May 19, 2020

Uh oh!

TheNeuralBit May 19, 2020

Choose a reason for hiding this comment

Uh oh!

nielm May 19, 2020

Choose a reason for hiding this comment

Uh oh!

TheNeuralBit left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nielm commented Apr 27, 2020 •

edited

Loading