[BEAM-14166] Performance improvements for RowWithGetter #17172

mosche · 2022-03-24T13:47:30Z

Push field type based logic in RowWithGetters down into FieldValueGetters to remove any branching and allow for better inlining and switch to TreeMap for caching to minimize memory footprint. See benchmark results below.

To not modify the behavior of getValues(), related calls are delegated to the original getter as is.

Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:

Choose reviewer(s) and mention them in a comment (R: @username).
Format the pull request title like [BEAM-XXX] Fixes bug in ApproximateQuantiles, where you replace BEAM-XXX with the appropriate JIRA issue, if applicable. This will automatically link the pull request to the issue.
Update CHANGES.md with noteworthy changes.
If this contribution is large, please file an Apache Individual Contributor License Agreement.

See the Contributor Guide for more tips on how to make review process smoother.

To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md

GitHub Actions Tests Status (on master branch)

See CI.md for more information about GitHub Actions CI.

mosche · 2022-03-24T13:49:05Z

R: @TheNeuralBit
R: @reuvenlax
R: @kennknowles

reuvenlax · 2022-03-24T15:25:41Z

By default, getValue results in generated bytecode to fetch the value. Is this cache really more efficient than the generated bytecode?

TheNeuralBit · 2022-03-24T15:32:50Z

By default, getValue results in generated bytecode to fetch the value. Is this cache really more efficient than the generated bytecode?

Maybe some microbenchmarks could help here

mosche · 2022-03-24T15:44:55Z

@reuvenlax certainly a valid question and I'm happy to discuss what's worth caching.

Though, the key point here is, that there's already a cache in place for array, iterable and map types. Processing for these is currently not ideal. For instance, looking at an array field the following steps happen on getValue:

Invoke getter, the generated byte code wraps the array in a list
Then lazily transform the list elements (Lists.transform) and cache the result

The problem is really on the 2nd access. Step 1) is repeated, but step 2) uses the cache discarding all the work of step 1).

This PR mostly addresses some of the inconsistencies using the existing cache and reduces the overhead of the cache by choosing better fitted data structures to store cached values.

mosche · 2022-03-24T15:55:19Z

Considering that elements of ARRAY types and ITERABLE types are lazily transformed, the value of caching is rather small (questionable?) as it happens on demand.

For MAP types it's a different story as the entire map gets rewritten on transform.

And if a cache is used, then any composite ROW type should be cached. Otherwise, if re-created every time, the cache is obviously pointless.

mosche · 2022-03-25T15:11:15Z

I started doing some jmh benchmarks for RowWithGetter. I'll follow up with next week on this...

reuvenlax · 2022-03-28T16:47:08Z

Benchmarks of this code are tricky to do, since the cost of codegen tends to dominate microbenchmarks.

mosche · 2022-03-29T08:22:41Z

I've pushed my benchmark code for reference, let me know if you have any suggestions. To prevent the issue you've mentioned @reuvenlax, I'm setting up a relatively large number of rows as JMH state before the actual benchmark invocation.

To establish a baseline, I'm looking at master first. Here are some initial results with some minimal changes to RowWithGetters to make these benchmarks meaningful:

Disable caching for the first run.
Change initialisation of the cache data structure to lazy init so associated costs are considered in the benchmark.

These numbers are certainly not very much in favour of the status quo.
I'm still iterating on improvements, but from what I've seen so far far there's lots that can be done.

mosche · 2022-04-01T14:36:08Z

I've drilled down into this a bit and I think I've got some interesting finding's to share @reuvenlax & @TheNeuralBit.

Investigating a few approaches, I would to suggest to push field type based logic in RowWithGetters down into FieldValueGetters to remove any branching and allow for better inlining, see code & benchmark.

I also looked a bit into costs of caching, the picture isn't as clear there. The costs of initializing any data structure facilitating a cache is certainly high compared to the costs of calling getters. One finding though was that TreeMap didn't perform any worse than HashMap. Given the much lower memory footprint that might be a good pick then. Also, using materialized Pojo lists helped to improve the performance gain from caching (compared to lazy transforms using Lists.transform).

On the other hand, I'm not sure what the original motivation for adding a field value cache in RowWithGetters was. Is it just about performance?

Some visualizations for a few selected runs:

mosche · 2022-04-11T15:34:31Z

Run Java PreCommit

…reeMap to reduce memory footprint of field value cache.

mosche · 2022-04-12T07:45:24Z

Run Java PreCommit

mosche · 2022-04-12T09:39:32Z

Run Java PreCommit

mosche · 2022-04-12T14:30:20Z

@TheNeuralBit @reuvenlax Please let me know how / if you wanna proceed here, so I can wrap up my initial PR #16947 accordingly. In any case, it was a very interesting dive into Beam Schemas. Thanks for all the pointers.

mosche · 2022-04-19T14:57:42Z

@TheNeuralBit @reuvenlax ping

aromanenko-dev · 2022-04-27T13:22:18Z

@TheNeuralBit @reuvenlax
Kind ping on this

TheNeuralBit · 2022-05-11T21:46:54Z

Run SQL PostCommit

TheNeuralBit · 2022-05-11T21:47:01Z

Run Java PreCommit

TheNeuralBit

Thanks @mosche for the extensive benchmarking, and apologies for the atrocious review latency!

I'm ok with this if it passes current tests, given the extensive benchmarking. A couple of things:

I'd still like to get some final feedback on the approach from @reuvenlax, but I acn go ahead and merge next Monday if that doesn't happen this week.
It could be interesting to run your benchmarks continuously to 1) monitor for regressions, and 2) give us a convenient way to test other potential improvements. Maybe file a jira for that if you don't have time now?

TheNeuralBit · 2022-05-11T21:47:15Z

sdks/java/core/src/main/java/org/apache/beam/sdk/schemas/FieldValueGetter.java

  @Nullable
  ValueT get(ObjectT object);

+  default @Nullable Object getRaw(ObjectT object) {


nit: maybe getUntyped is a more appropriate name here?

Also could you clarify why we need this?

Thanks so much for having a look @TheNeuralBit 🙏

getRaw() was based on a conversation with @reuvenlax.

getValues() is maybe poorly named - might be better called getRawValues. What you're looking for is probably the getBaseValues() method.
getValues is mostly used in code that knows exactly what it's doing for optimization purposes. It goes along with the attachValues method, which is similarly tricky to use. It's there to enable 0-copy code, but not necessarily intended for general consumption.

RowWithGetters.getValues() returns the "raw" unmodified result of the getters:

public List<Object> getValues() { return getters.stream().map(g -> g.get(getterTarget)).collect(Collectors.toList()); }

As I am pushing down the transformation of the getter result into the getter itself, I needed a way to bypass that in order to maintain the current semantics of getValues(). Let me know if the name makes sense given that context.

Yeah that makes sense, thanks. Could you add some of this context in a comment there?

@TheNeuralBit I've opened a new PR to add the missing comment, sorry for the delay.
#21982

reuvenlax · 2022-05-11T22:07:09Z

Let me take another look. IIRC I had some concerns, but it's possible that the extensive benchmarking addressed those.

…

On Wed, May 11, 2022 at 2:58 PM Brian Hulette ***@***.***> wrote: ***@***.**** approved this pull request. Thanks @mosche <https://github.com/mosche> for the extensive benchmarking, and apologies for the atrocious review latency! I'm ok with this if it passes current tests, given the extensive benchmarking. A couple of things: - I'd still like to get some final feedback on the approach from @reuvenlax <https://github.com/reuvenlax>, but I acn go ahead and merge next Monday if that doesn't happen this week. - It could be interesting to run your benchmarks continuously to 1) monitor for regressions, and 2) give us a convenient way to test other potential improvements. Maybe file a jira for that if you don't have time now? ------------------------------ In sdks/java/core/src/main/java/org/apache/beam/sdk/schemas/FieldValueGetter.java <#17172 (comment)>: > @@ -33,5 +33,9 @@ @nullable ValueT get(ObjectT object); + default @nullable Object getRaw(ObjectT object) { nit: maybe getUntyped is a more appropriate name here? Also could you clarify why we need this? — Reply to this email directly, view it on GitHub <#17172 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AFAYJVOBUJMEVNFJNON7Y7DVJQUOVANCNFSM5RRHFBKQ> . You are receiving this because you were mentioned.Message ID: ***@***.***>

mosche · 2022-05-17T07:52:38Z

@reuvenlax Kind ping, did you already get the chance to have another look at this? Just to make it clear, this is not about caching ... that was just a minor almost irrelevant part of this

aaltay · 2022-05-27T02:54:34Z

@reuvenlax - Would you be able to take another look?

TheNeuralBit · 2022-05-28T00:10:48Z

I think this is fine to go ahead and merge. From what I can tell Reuven's original concern was that this would start caching more types than were already being cached in RowWithGetters, but this is just moving around the caching logic.

aromanenko-dev · 2022-05-30T08:06:35Z

@TheNeuralBit Thanks for moving forward and finally merge it!

mosche · 2022-05-30T08:28:18Z

@TheNeuralBit Thanks a lot 🙏 I still owe you the comment on FieldValueGetter.getRaw, i'll follow up shortly!

mosche · 2022-06-07T12:23:15Z

Resolves #21634

reuvenlax · 2022-10-11T07:47:55Z

Will look at it today or tomorrow - currently on a flight.

…

On Tue, May 17, 2022 at 3:52 AM Moritz Mack ***@***.***> wrote: @reuvenlax <https://github.com/reuvenlax> Kind ping, did you already get the chance to have another look at this? Just to make it clear, this is not about caching ... that was just a minor almost irrelevant part of this — Reply to this email directly, view it on GitHub <#17172 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AFAYJVOUKSLU2SLR3QEQZ3TVKNF5LANCNFSM5RRHFBKQ> . You are receiving this because you were mentioned.Message ID: ***@***.***>

github-actions bot added the java label Mar 24, 2022

mosche mentioned this pull request Mar 24, 2022

[BEAM-13416] Introduce Schema provider for AWS model classes extending SdkPojo #16947

Merged

4 tasks

mosche force-pushed the BEAM-14166-RowWithGetter branch from 81faf59 to 9ddc834 Compare March 24, 2022 13:56

mosche force-pushed the BEAM-14166-RowWithGetter branch from 9ddc834 to c06f88b Compare April 1, 2022 15:41

[BEAM-14166] Push logic in RowWithGetters down into getters and use T…

f57b9f2

…reeMap to reduce memory footprint of field value cache.

mosche force-pushed the BEAM-14166-RowWithGetter branch from c06f88b to f57b9f2 Compare April 11, 2022 18:05

mosche mentioned this pull request Apr 22, 2022

[WIP][BEAM-8715] Bump Avro version to 1.9.2 #17372

Closed

4 tasks

aaltay requested review from TheNeuralBit and reuvenlax May 5, 2022 14:56

TheNeuralBit approved these changes May 11, 2022

View reviewed changes

mosche changed the title ~~[BEAM-14166] Performance / Cache improvements to RowWithGetter~~ [BEAM-14166] Performance improvements for RowWithGetter May 17, 2022

TheNeuralBit merged commit 7c47893 into apache:master May 28, 2022

mosche deleted the BEAM-14166-RowWithGetter branch June 7, 2022 12:24

mosche mentioned this pull request Jun 22, 2022

[#21634] Add comments on FieldValueGetter. #21982

Merged

4 tasks

mosche mentioned this pull request Aug 8, 2023

Conversion from Avro GenericRecords to Beam Rows takes too much time #20894

Open

[BEAM-14166] Performance improvements for RowWithGetter #17172

[BEAM-14166] Performance improvements for RowWithGetter #17172

Uh oh!

Conversation

mosche commented Mar 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

GitHub Actions Tests Status (on master branch)

Uh oh!

mosche commented Mar 24, 2022

Uh oh!

reuvenlax commented Mar 24, 2022

Uh oh!

TheNeuralBit commented Mar 24, 2022

Uh oh!

mosche commented Mar 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mosche commented Mar 24, 2022

Uh oh!

mosche commented Mar 25, 2022

Uh oh!

reuvenlax commented Mar 28, 2022

Uh oh!

mosche commented Mar 29, 2022

Uh oh!

mosche commented Apr 1, 2022

Uh oh!

mosche commented Apr 11, 2022

Uh oh!

mosche commented Apr 12, 2022

Uh oh!

mosche commented Apr 12, 2022

Uh oh!

mosche commented Apr 12, 2022

Uh oh!

mosche commented Apr 19, 2022

Uh oh!

aromanenko-dev commented Apr 27, 2022

Uh oh!

TheNeuralBit commented May 11, 2022

Uh oh!

TheNeuralBit commented May 11, 2022

Uh oh!

TheNeuralBit left a comment

Choose a reason for hiding this comment

Uh oh!

TheNeuralBit May 11, 2022

Choose a reason for hiding this comment

Uh oh!

mosche May 12, 2022

Choose a reason for hiding this comment

Uh oh!

TheNeuralBit May 12, 2022

Choose a reason for hiding this comment

Uh oh!

mosche Jun 22, 2022

Choose a reason for hiding this comment

Uh oh!

reuvenlax commented May 11, 2022 via email

Uh oh!

mosche commented May 17, 2022

Uh oh!

aaltay commented May 27, 2022

Uh oh!

TheNeuralBit commented May 28, 2022

Uh oh!

aromanenko-dev commented May 30, 2022

Uh oh!

mosche commented May 30, 2022

Uh oh!

mosche commented Jun 7, 2022

Uh oh!

reuvenlax commented Oct 11, 2022 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

mosche commented Mar 24, 2022 •

edited

Loading

mosche commented Mar 24, 2022 •

edited

Loading