[GLUTEN-11330][VL] Make PartialProject support array and map with null values by jiangjiangtian · Pull Request #11331 · apache/gluten

jiangjiangtian · 2025-12-25T08:55:18Z

What changes are proposed in this pull request?

This PR introduces a new class named ArrowColumnarArray. Its implementation is copied from Spark-4.0, except that the handleNull parameter is set to true when we call SpecializedGettersReader.read in get, which means that when trying to access a value of array, we will check whether the value to get is null first. So we can avoid throwing exception when we try to access a null value of array.
Besides, this PR introduces another new class named ArrowColumnarMap. This class defines two fields of type ArrowColumnarArray to represent keys and values, separately. With this class, we can also avoid throwing exception when we try to access a null value of map.

How was this patch tested?

unit tests.

Related issue: #11330

github-actions · 2025-12-25T11:23:18Z

Run Gluten Clickhouse CI on x86

github-actions · 2025-12-25T11:31:23Z

Run Gluten Clickhouse CI on x86

github-actions · 2025-12-25T11:53:58Z

Run Gluten Clickhouse CI on x86

jinchengchenghh

Thanks for your enhancement!

jinchengchenghh · 2025-12-30T02:19:24Z

gluten-arrow/src/main/java/org/apache/gluten/vectorized/ArrowColumnarArray.java

+import org.apache.spark.sql.vectorized.ColumnVector;
+
+/**
+ * Because `get` method in `ColumnarArray` don't check whether the data to get is null and arrow


Is this the Spark shortage or design? What's the Spark usage for ColumnarArray with null value?

I think it is Spark shortage. If there exists null values, ColumnarArray will get the value(this might be a default value or previously set value) because call get on ColumnarArray will eventually call getXXX on ColumnVector and getXXX will not check if it is null value, either.

Could you also raise an issue in Spark?

OK, I will. Thanks.

…l values (apache#11331) --------- Co-authored-by: jiangtian <JT2677636391@outlook.com>

Make PartialProject support array and map with null values

5d103b9

github-actions bot added the VELOX label Dec 25, 2025

jiangjiangtian changed the title ~~[GLUTEN-11330][VL]Make PartialProject support array and map with null values~~ [GLUTEN-11330][VL] Make PartialProject support array and map with null values Dec 25, 2025

github-actions bot added the CORE works for Gluten Core label Dec 25, 2025

jiangjiangtian force-pushed the access_null_value_in_map branch from f9c5f70 to 55c5764 Compare December 25, 2025 11:30

fix

16ab9d6

jiangjiangtian force-pushed the access_null_value_in_map branch from 55c5764 to 16ab9d6 Compare December 25, 2025 11:53

jinchengchenghh reviewed Dec 30, 2025

View reviewed changes

jinchengchenghh approved these changes Dec 30, 2025

View reviewed changes

jinchengchenghh merged commit 41073d5 into apache:main Jan 2, 2026
106 of 109 checks passed

QCLyu pushed a commit to QCLyu/incubator-gluten that referenced this pull request Jan 8, 2026

[GLUTEN-11330][VL] Make PartialProject support array and map with nul…

d32882f

…l values (apache#11331) --------- Co-authored-by: jiangtian <JT2677636391@outlook.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GLUTEN-11330][VL] Make PartialProject support array and map with null values#11331

[GLUTEN-11330][VL] Make PartialProject support array and map with null values#11331
jinchengchenghh merged 2 commits intoapache:mainfrom
jiangjiangtian:access_null_value_in_map

jiangjiangtian commented Dec 25, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Dec 25, 2025

Uh oh!

github-actions bot commented Dec 25, 2025

Uh oh!

github-actions bot commented Dec 25, 2025

Uh oh!

jinchengchenghh left a comment

Uh oh!

jinchengchenghh Dec 30, 2025

Uh oh!

jiangjiangtian Dec 30, 2025

Uh oh!

jinchengchenghh Dec 30, 2025

Uh oh!

jiangjiangtian Dec 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jiangjiangtian commented Dec 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes are proposed in this pull request?

How was this patch tested?

Uh oh!

github-actions bot commented Dec 25, 2025

Uh oh!

github-actions bot commented Dec 25, 2025

Uh oh!

github-actions bot commented Dec 25, 2025

Uh oh!

jinchengchenghh left a comment

Choose a reason for hiding this comment

Uh oh!

jinchengchenghh Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

jiangjiangtian Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

jinchengchenghh Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

jiangjiangtian Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jiangjiangtian commented Dec 25, 2025 •

edited

Loading