[CORE] Defer Protobuf serialization of SplitInfos in GlutenPartitions by kevinwilfong · Pull Request #10662 · apache/gluten

kevinwilfong · 2025-09-09T23:14:07Z

What changes are proposed in this pull request?

Today the GlutenPartition objects contain an array of byte arrays which are the Protobuf serialized ReadRel.read_type objects from the SplitInfos. The GlutenPartitions are Java serialized and sent to the Executors responsible for their respective Tasks. Looking through the code it appears we Protobuf serialize the SplitInfos so we can easily pass them across the JNI boundary.

We see the serialized SplitInfos can consume a significant amount of memory in the Driver, this is because as SplitInfo objects their state can share references tot he same objects, but once serialized they share nothing, which explodes their size in memory.

If we Java serialize the SplitInfo objects like the rest of the GlutenPartition state, and Protobuf serialize them as part of the Task, this can significantly save driver memory. The cost is a little additional memory in the Task, the size of the SplitInfo objects for a single GlutenPartition which should be trivial, and a little additional CPU instead of Protobuf serializing in the Driver and Java serializing the array of byte arrays, we Java serialize the array of SplitInfos, and on the Task we pay the additional cost of Java deserializing an array of SplitInfos and Protobuf serializing them, overall the difference is just the additional cost of Java serializing the SplitInfos instead of byte arrays.

How was this patch tested?

Ran the existing unit tests.

Verified locally that a query with particularly high Driver memory due to serialized SplitInfos saw a significant reduction.

github-actions · 2025-09-09T23:14:38Z

Run Gluten Clickhouse CI on x86

github-actions · 2025-09-10T17:27:01Z

Run Gluten Clickhouse CI on x86

github-actions · 2025-09-10T20:59:53Z

Run Gluten Clickhouse CI on x86

kevinwilfong · 2025-09-10T23:36:47Z

It looks like tests are failing due to #10671 (not related to this change)

marin-ma

LGTM. Thanks!
Just one question regarding the solution of this performance issue: Does it mean the driver memory can be decreased with this patch because java serialisation only serialise the same object only once?

kevinwilfong · 2025-09-12T17:00:51Z

@marin-ma Thanks for the review!

Does it mean the driver memory can be decreased with this patch because java serialisation only serialise the same object only once?

I suspect the reason is that Spark Java serializes the GlutenPartitions as needed and does not hold the serialized values in memory for a long time. In Gluten, we're currently Protobuf serializing the SplitInfos when we create the GlutenPartitions, and I see a large number of these GlutenPartitions getting held in the Driver's memory while the query is running, so the serialized SplitInfos all exist together at the same time. If Spark is Java serializing the GlutenPartitions only when a Task is ready to execute, and evicts the serialized value from memory as soon as it's been sent to the Executor, with this change we'll only end up with a relatively small number of serialized values present in the Driver's memory at the same time (proportional to the number of Executors).

…apache#10662)

Yohahaha · 2025-09-23T03:19:39Z

#6572

We have another pr to decrease driver memory pressure, just post here to see if we can apply.

…apache#10662) (cherry picked from commit 0d170be)

github-actions bot added CORE works for Gluten Core VELOX CLICKHOUSE labels Sep 9, 2025

kevinwilfong force-pushed the serialize_split branch from a1c4d58 to d175bfe Compare September 10, 2025 17:26

[CORE] Defer Protobuf serialization of SplitInfos in GlutenPartitions

2e2ecae

kevinwilfong force-pushed the serialize_split branch from d175bfe to 2e2ecae Compare September 10, 2025 20:58

kevinwilfong marked this pull request as ready for review September 10, 2025 23:36

kevinwilfong requested a review from marin-ma September 10, 2025 23:43

marin-ma approved these changes Sep 12, 2025

View reviewed changes

marin-ma merged commit 0d170be into apache:main Sep 12, 2025
52 of 56 checks passed

kevinwilfong added a commit to kevinwilfong/incubator-gluten that referenced this pull request Sep 12, 2025

[CORE] Defer Protobuf serialization of SplitInfos in GlutenPartitions (…

a8d0b3b

…apache#10662)

kevinwilfong mentioned this pull request Sep 17, 2025

[VL] Support mapping columns by position index for ORC and Parquet files #10697

Merged

wForget pushed a commit to wForget/gluten that referenced this pull request Sep 23, 2025

[CORE] Defer Protobuf serialization of SplitInfos in GlutenPartitions (…

bdff506

…apache#10662) (cherry picked from commit 0d170be)

wForget mentioned this pull request Sep 23, 2025

[DNM] Test #10786

Closed

wForget pushed a commit to wForget/gluten that referenced this pull request Oct 15, 2025

[CORE] Defer Protobuf serialization of SplitInfos in GlutenPartitions (…

7836608

…apache#10662) (cherry picked from commit 0d170be)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CORE] Defer Protobuf serialization of SplitInfos in GlutenPartitions#10662

[CORE] Defer Protobuf serialization of SplitInfos in GlutenPartitions#10662
marin-ma merged 1 commit intoapache:mainfrom
kevinwilfong:serialize_split

kevinwilfong commented Sep 9, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 9, 2025

Uh oh!

github-actions bot commented Sep 10, 2025

Uh oh!

github-actions bot commented Sep 10, 2025

Uh oh!

kevinwilfong commented Sep 10, 2025 •

edited

Loading

Uh oh!

marin-ma left a comment

Uh oh!

kevinwilfong commented Sep 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Yohahaha commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kevinwilfong commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes are proposed in this pull request?

How was this patch tested?

Uh oh!

github-actions bot commented Sep 9, 2025

Uh oh!

github-actions bot commented Sep 10, 2025

Uh oh!

github-actions bot commented Sep 10, 2025

Uh oh!

kevinwilfong commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

marin-ma left a comment

Choose a reason for hiding this comment

Uh oh!

kevinwilfong commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Yohahaha commented Sep 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kevinwilfong commented Sep 9, 2025 •

edited

Loading

kevinwilfong commented Sep 10, 2025 •

edited

Loading

kevinwilfong commented Sep 12, 2025 •

edited

Loading