Skip to content

[VL] New option to follow vanilla Spark's build side in shuffled hash join#7133

Merged
kecookier merged 2 commits intoapache:mainfrom
zhztheplayer:wip-build-side
Sep 6, 2024
Merged

[VL] New option to follow vanilla Spark's build side in shuffled hash join#7133
kecookier merged 2 commits intoapache:mainfrom
zhztheplayer:wip-build-side

Conversation

@zhztheplayer
Copy link
Copy Markdown
Member

@zhztheplayer zhztheplayer commented Sep 5, 2024

It's observed the new way to choose smaller side of shuffled hash join as build side, introduced in #6093, could cause performance regression comparing to elder versions of Gluten because of inaccurate statistic calculations.

The PR provides an option spark.gluten.sql.columnar.shuffledHashJoin.optimizeBuildSide which can be set to false to keep using vanilla Spark's build side in Gluten's shuffled hash join.

@github-actions github-actions Bot added CORE works for Gluten Core DOCS labels Sep 5, 2024
@github-actions
Copy link
Copy Markdown

github-actions Bot commented Sep 5, 2024

Run Gluten Clickhouse CI

@apache apache deleted a comment from github-actions Bot Sep 5, 2024
@apache apache deleted a comment from github-actions Bot Sep 5, 2024
@zhztheplayer
Copy link
Copy Markdown
Member Author

cc @kecookier @weiting-chen

@kecookier
Copy link
Copy Markdown
Contributor

It looks good to me. Thanks!

@kecookier kecookier merged commit d285b7a into apache:main Sep 6, 2024
dcoliversun pushed a commit to dcoliversun/gluten that referenced this pull request Sep 11, 2024
… join (apache#7133)

* [VL] New option to follow vanilla Spark's build side in shuffled hash join

* fixup
sharkdtu pushed a commit to sharkdtu/gluten that referenced this pull request Nov 11, 2024
… join (apache#7133)

* [VL] New option to follow vanilla Spark's build side in shuffled hash join

* fixup
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CORE works for Gluten Core DOCS

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants