[KafkaIO] Fix average record size data race and backlog estimation #34165

sjvanrossum · 2025-03-04T17:17:02Z

The offset gap ratio may artificially shrink the backlog if consumers can't catch up to the tail of an expiring topic. This may cause runners to trigger a downscaling event which worsens the issue.

MovingAvg has been modified to atomically write the accumulated state since concurrent normal loads/stores of longs/doubles may tear. The numUpdates field is only used by the writer and can be kept as non-volatile, but the update method ensures that normal loads/stores on numUpdates are ordered in relation to acquiring loads and releasing stores on avg. To prevent false sharing I've padded the class since there may be tens to hundreds of instances of the accumulator and updates happen per consumed record.

The JMH benchmark I've added shows a slight uplift in average time per op for both reads and writes compared to the current implementation.

Results of task :sdks:java:io:kafka:jmh:jmh on a t2d-standard-60 Cloud Workstation:

Benchmark                                 Mode  Cnt       Score       Error  Units
KafkaIOUtilsBenchmark.Atomic              avgt   15   50693.751 ±  4937.770  ns/op
KafkaIOUtilsBenchmark.Atomic:atomicRead   avgt   15    5577.357 ±  1135.962  ns/op
KafkaIOUtilsBenchmark.Atomic:atomicWrite  avgt   15  140926.539 ± 14847.095  ns/op
KafkaIOUtilsBenchmark.Plain               avgt   15   65018.754 ±  9814.457  ns/op
KafkaIOUtilsBenchmark.Plain:plainRead     avgt   15    6658.736 ±   288.912  ns/op
KafkaIOUtilsBenchmark.Plain:plainWrite    avgt   15  181738.789 ± 29403.883  ns/op

Note that this test likely does not highlight the effect of padding since it doesn't construct a large pool of accumulators.

Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:

Mention the appropriate issue in your description (for example: addresses #123), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, comment fixes #<ISSUE NUMBER> instead.
Update CHANGES.md with noteworthy changes.
If this contribution is large, please file an Apache Individual Contributor License Agreement.

See the Contributor Guide for more tips on how to make review process smoother.

To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md

GitHub Actions Tests Status (on master branch)

See CI.md for more information about GitHub Actions CI or the workflows README to see a list of phrases to trigger workflows.

github-actions · 2025-03-04T19:06:01Z

Checks are failing. Will not request review until checks are succeeding. If you'd like to override that behavior, comment assign set of reviewers

sjvanrossum · 2025-03-05T14:27:55Z

Run Spotless PreCommit

github-actions · 2025-03-05T16:08:43Z

Assigning reviewers. If you would like to opt out of this review, comment assign to next reviewer:

R: @kennknowles for label java.
R: @Abacn for label build.
R: @Abacn for label io.
R: @johnjcasey for label kafka.

Available commands:

stop reviewer notifications - opt out of the automated review tooling
remind me after tests pass - tag the comment author after tests pass
waiting on author - shift the attention set back to the author (any comment or push by the author will return the attention set to the reviewers)

The PR bot will only process comments in the main thread (not review comments).

kennknowles

Happy for the improvement. I may be misunderstanding what is strictly necessary to make your changes work as intended but TL;DR the inheritance all seems extraneous - one of them seems like inlining is equivalent and clearer, while the other seems like it is more clearly expressed as a field.

sdks/java/io/kafka/jmh/build.gradle

sdks/java/io/kafka/src/main/java/org/apache/beam/sdk/io/kafka/KafkaIOUtils.java

sjvanrossum · 2025-03-11T09:00:59Z

@kennknowles I've updated the experiment setup in the benchmark. I've reduced concurrent readers to 1, because as far as I know only the ProcessElement thread reads and writes the accumulator and a GetSize thread occassionally reads the accumulator concurrently in the SDF implementation and it's either similar or entirely single-threaded for the unbounded source implementation. I've also split these tests out into concurrents reads and writes and isolated reads and writes, because those concurrent reads only happen every so often and occasionally overlap with writes (observed in #32921 rarely triggering ConcurrentModificationException). With multiple concurrent readers I had observed 10-20% improvement with layout padding, but the improvements are barely noticeable now if at all (on my workstation).

Benchmark                                                                               Mode  Cnt   Score   Error  Units
KafkaIOUtilsBenchmark.ReadAndWriteAtomic                                                avgt   15  12.948 ± 1.999  ns/op
KafkaIOUtilsBenchmark.ReadAndWriteAtomic:atomicReadWhileWriting                         avgt   15   4.798 ± 0.200  ns/op
KafkaIOUtilsBenchmark.ReadAndWriteAtomic:atomicWriteWhileReading                        avgt   15  21.098 ± 3.811  ns/op
KafkaIOUtilsBenchmark.ReadAndWritePaddedAtomic                                          avgt   15  13.676 ± 1.509  ns/op
KafkaIOUtilsBenchmark.ReadAndWritePaddedAtomic:paddedAtomicReadWhileWriting             avgt   15   4.598 ± 0.064  ns/op
KafkaIOUtilsBenchmark.ReadAndWritePaddedAtomic:paddedAtomicWriteWhileReading            avgt   15  22.755 ± 3.075  ns/op
KafkaIOUtilsBenchmark.ReadAndWritePlain                                                 avgt   15  15.909 ± 1.780  ns/op
KafkaIOUtilsBenchmark.ReadAndWritePlain:plainReadWhileWriting                           avgt   15   4.058 ± 0.160  ns/op
KafkaIOUtilsBenchmark.ReadAndWritePlain:plainWriteWhileReading                          avgt   15  27.760 ± 3.589  ns/op
KafkaIOUtilsBenchmark.ReadAndWriteSynchronizedPlain                                     avgt   15  95.190 ± 1.845  ns/op
KafkaIOUtilsBenchmark.ReadAndWriteSynchronizedPlain:synchronizedPlainReadWhileWriting   avgt   15  98.039 ± 3.973  ns/op
KafkaIOUtilsBenchmark.ReadAndWriteSynchronizedPlain:synchronizedPlainWriteWhileReading  avgt   15  92.341 ± 2.358  ns/op
KafkaIOUtilsBenchmark.ReadAndWriteVolatile                                              avgt   15  26.432 ± 4.415  ns/op
KafkaIOUtilsBenchmark.ReadAndWriteVolatile:volatileReadWhileWriting                     avgt   15   3.879 ± 0.056  ns/op
KafkaIOUtilsBenchmark.ReadAndWriteVolatile:volatileWriteWhileReading                    avgt   15  48.984 ± 8.849  ns/op
KafkaIOUtilsBenchmark.ReadAtomic                                                        avgt   15   2.185 ± 0.007  ns/op
KafkaIOUtilsBenchmark.ReadPaddedAtomic                                                  avgt   15   2.193 ± 0.010  ns/op
KafkaIOUtilsBenchmark.ReadPlain                                                         avgt   15   2.206 ± 0.011  ns/op
KafkaIOUtilsBenchmark.ReadSynchronizedPlain                                             avgt   15   7.975 ± 0.571  ns/op
KafkaIOUtilsBenchmark.ReadVolatile                                                      avgt   15   2.199 ± 0.011  ns/op
KafkaIOUtilsBenchmark.WriteAtomic                                                       avgt   15   8.183 ± 0.003  ns/op
KafkaIOUtilsBenchmark.WritePaddedAtomic                                                 avgt   15   8.183 ± 0.003  ns/op
KafkaIOUtilsBenchmark.WritePlain                                                        avgt   15   9.592 ± 0.746  ns/op
KafkaIOUtilsBenchmark.WriteSynchronizedPlain                                            avgt   15  11.145 ± 1.714  ns/op
KafkaIOUtilsBenchmark.WriteVolatile                                                     avgt   15  11.967 ± 0.003  ns/op

I agree that it's not worth the hassle to maintain unless there's a significant upside so I've removed the layout padding from MovingAvg based on the results above.

…periment setup

kennknowles · 2025-03-18T17:09:58Z

sdks/java/io/kafka/src/main/java/org/apache/beam/sdk/io/kafka/KafkaUnboundedReader.java

          continue;
        }

-        long offsetGap = offset - expected; // could be > 0 when Kafka log compaction is enabled.


I admit I don't know this well enough to know why we tracked this or why it can be removed, even with your description in the PR. I trust your experiments, and I don't see this being a data integrity issue, though. I'd love to be educated at some point.

github-actions bot added java build io kafka labels Mar 4, 2025

github-actions bot added the Next Action: Reviewers label Mar 5, 2025

sjvanrossum force-pushed the kafkaio-sdf-backlog-estimation branch from 52e2c5f to ab5b88d Compare March 6, 2025 22:33

sjvanrossum mentioned this pull request Mar 7, 2025

[KafkaIO] Remove duplicate offset in range check #34201

Merged

3 tasks

kennknowles requested changes Mar 7, 2025

View reviewed changes

sjvanrossum added 10 commits March 14, 2025 12:53

Fix data race in MovingAvg, remove offset gap ratio

490012c

Suppress UUF_UNUSED_FIELD warning

92d3ec0

Add package-info.java

066dbea

Fix sloppy copy/paste

168a1ab

Move JMH content to jmh package

519259e

Change MovingAvg's class and method visibility

a3069b6

Remove comment

acc07a8

Add synchronized plain and volatile benchmarks and improvements to ex…

713d496

…periment setup

Remove layout padding

c5de3b2

Rewrite build file using Kotlin DSL

e90f8fc

sjvanrossum force-pushed the kafkaio-sdf-backlog-estimation branch from 0ba68c2 to e90f8fc Compare March 14, 2025 12:54

sjvanrossum requested a review from kennknowles March 14, 2025 15:51

kennknowles approved these changes Mar 18, 2025

View reviewed changes

kennknowles merged commit e71fdce into apache:master Mar 18, 2025
23 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[KafkaIO] Fix average record size data race and backlog estimation #34165

[KafkaIO] Fix average record size data race and backlog estimation #34165

Uh oh!

sjvanrossum commented Mar 4, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Mar 4, 2025

Uh oh!

sjvanrossum commented Mar 5, 2025

Uh oh!

github-actions bot commented Mar 5, 2025

Uh oh!

kennknowles left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sjvanrossum commented Mar 11, 2025 •

edited

Loading

Uh oh!

kennknowles Mar 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[KafkaIO] Fix average record size data race and backlog estimation #34165

[KafkaIO] Fix average record size data race and backlog estimation #34165

Uh oh!

Conversation

sjvanrossum commented Mar 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

GitHub Actions Tests Status (on master branch)

Uh oh!

github-actions bot commented Mar 4, 2025

Uh oh!

sjvanrossum commented Mar 5, 2025

Uh oh!

github-actions bot commented Mar 5, 2025

Uh oh!

kennknowles left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sjvanrossum commented Mar 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kennknowles Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sjvanrossum commented Mar 4, 2025 •

edited

Loading

sjvanrossum commented Mar 11, 2025 •

edited

Loading