Flink: Dynamic Iceberg Sink: Optimise RowData evolution#13340

Merged

pvary merged 6 commits intoapache:mainfrom

aiborodin:optimise-row-data-conversion

Jun 26, 2025

Contributor

aiborodin commented Jun 18, 2025

RowDataEvolver recomputes Flink RowType and field getters for every input record that needs to match a destination Iceberg table schema. Cache field getters and column converters to optimise RowData conversion.

github-actions Bot added the flink label

aiborodin mentioned this pull request

Flink: Dynamic Iceberg Sink Contribution #12424

Closed

mxm reviewed

View reviewed changes

Contributor

mxm left a comment

Thanks for improving the performance on the conversion write path @aiborodin! It looks like this PR contains two separate changes:

Adding caching to the conversion write path
Refactoring RowDataEvolver to dynamically instantiate converter classes (quasi code generation)

I wonder if we can do (1) as a first step. RowDataEvolver so far has been static and I understand that it needs to become an object in order to add the cache, but perhaps we can use a central RowDataEvolver instance with a cache for source and target schema first. I'm not sure adding the code generation yields much performance and I would like to minimize the objects getting created.

...k/v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/DynamicRecordProcessor.java Outdated

...v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/convert/RowDataConverter.java Outdated

flink/v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/convert/MapConverter.java Outdated

...k/v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/DynamicRecordProcessor.java Outdated

....0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/DynamicTableUpdateOperator.java Outdated

aiborodin force-pushed the optimise-row-data-conversion branch from 913c0c6 to 0a6af3a Compare

June 19, 2025 08:35

Contributor Author

aiborodin commented Jun 19, 2025

According to the profile in my previous comment #13340 (comment), schema caching would not be sufficient and we also need to cache field accessors and converters to minimise the CPU overhead. The object overhead is minimal as each converter would only store filed accessors and conversion lambdas. The cache overhead is minimal because it is an identity cache and same schema objects are already cached in TableMetadataCache.

pvary reviewed

View reviewed changes

...k/v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/DynamicRecordProcessor.java Outdated

pvary reviewed

View reviewed changes

...k/v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/DynamicRecordProcessor.java Outdated

pvary reviewed

View reviewed changes

...k/v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/convert/ArrayConverter.java Outdated

pvary reviewed

View reviewed changes

...k/v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/convert/ArrayConverter.java Outdated

pvary reviewed

View reviewed changes

flink/v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/convert/DataConverter.java Outdated

pvary reviewed

View reviewed changes

flink/v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/convert/MapConverter.java Outdated

pvary reviewed

View reviewed changes

...v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/convert/RowDataConverter.java Outdated

pvary reviewed

View reviewed changes

...v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/convert/RowDataConverter.java Outdated

pvary reviewed

View reviewed changes

...v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/convert/RowDataConverter.java Outdated

aiborodin force-pushed the optimise-row-data-conversion branch from 0a6af3a to 5c63747 Compare

June 20, 2025 07:12

mxm reviewed

View reviewed changes

Contributor

mxm left a comment

Thanks for explaining the rational behind the change. This is an excellent contribution!

flink/v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/DynamicIcebergSink.java

flink/v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/TableMetadataCache.java Outdated

pvary reviewed

View reviewed changes

flink/v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/TableMetadataCache.java Outdated

pvary reviewed

View reviewed changes

flink/v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/TableMetadataCache.java Outdated

aiborodin force-pushed the optimise-row-data-conversion branch 3 times, most recently from c918919 to 8e45f21 Compare

June 25, 2025 04:27

pvary reviewed

View reviewed changes

flink/v2.0/flink/src/jmh/java/org/apache/iceberg/flink/sink/dynamic/CacheBenchmark.java Outdated

pvary reviewed

View reviewed changes

flink/v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/LRUCache.java

pvary reviewed

View reviewed changes

flink/v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/TableMetadataCache.java Outdated

pvary approved these changes

View reviewed changes

Contributor

pvary left a comment

LGTM +1
A few small changes, and we are ready

aiborodin force-pushed the optimise-row-data-conversion branch 3 times, most recently from eeb0687 to a888dc3 Compare

June 25, 2025 09:25

aiborodin added 2 commits

June 26, 2025 14:10


          Optimise RowData evolution

fbfe8ac

RowDataEvolver recomputes Flink RowType and field getters for every
input record that needs to match a destination Iceberg table schema.
Cache field getters and column converters to optimise RowData
conversion.


          Move data converters into TableMetadataCache

2339e78

TableMetadataCache already contains an identity cache to store schema
comparison results. Let's move the row data converter cache into
SchemaInfo and make it configurable.

aiborodin force-pushed the optimise-row-data-conversion branch from 83e1150 to 2339e78 Compare

June 26, 2025 04:11

mxm approved these changes

View reviewed changes

Contributor

mxm left a comment

Thanks a lot @aiborodin!

flink/v2.0/flink/src/main/java/org/apache/iceberg/flink/sink/dynamic/LRUCache.java

pvary added 4 commits

June 26, 2025 17:26


          Update LRUCache javadoc

eedaeb2


          Update LRUCache fix javadoc formatting

92a1560


          Update LRUCache.java javadoc fix last try :D

d526924


          Update LRUCache.java - I hate this and never do this again

4dcbd63

Contributor

mxm commented Jun 26, 2025

Nice last commits 😂

pvary merged commit 7443e54 into apache:main

18 checks passed

pvary changed the title ~~Optimise RowData evolution~~ Flink: Dynamic Iceberg Sink: Optimise RowData evolution

Contributor

pvary commented Jun 26, 2025

Merged to main.
Thanks for the optimization @aiborodin and @mxm for the review.

@aiborodin: Could you please create a backport PR to port these changes to Flink 1.20, 1.19.
This sed command could help:

g diff HEAD~1 HEAD flink/v2.0 |sed "s/v2.0/v1.20/g">/tmp/patch

Also, you need to change anything above cleanly applying the change, please highlight, so it is easier to review.

Thanks for all of your work on this! Happy to have you as a contributor!

aiborodin deleted the optimise-row-data-conversion branch

June 27, 2025 06:00

aiborodin mentioned this pull request

Flink: Backport optimised RowData evolution to Flink 1.19 / 1.20 #13401

Merged

Contributor Author

aiborodin commented Jun 27, 2025

Thank you for merging and reviewing the change @pvary!
I appreciate your and @mxm's valuable feedback, and it's a pleasure to have you as reviewers.
I raised this PR to backport the changes to Flink 1.19 / 1.20: #13401.

aiborodin mentioned this pull request

Flink 2.0: Replace Caffeine maxSize cache with LRUCache #13382

Merged

pvary pushed a commit that referenced this pull request


          Flink: Backport optimised RowData evolution to Flink 1.19 / 1.20 (#13401

c151c2d

)

backports #13340

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels