Skip to content

[GLUTEN-8616] [VL] Add support for Existence Join for broadcast nested loop join#9588

Merged
philo-he merged 2 commits intoapache:mainfrom
ArnavBalyan:arnavb/existence-join
May 19, 2025
Merged

[GLUTEN-8616] [VL] Add support for Existence Join for broadcast nested loop join#9588
philo-he merged 2 commits intoapache:mainfrom
ArnavBalyan:arnavb/existence-join

Conversation

@ArnavBalyan
Copy link
Copy Markdown
Member

@ArnavBalyan ArnavBalyan commented May 10, 2025

What changes were proposed in this pull request?

How was this patch tested?

  • Existing + New UTs

@github-actions github-actions bot added CORE works for Gluten Core VELOX labels May 10, 2025
@github-actions
Copy link
Copy Markdown

#8616

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@ArnavBalyan
Copy link
Copy Markdown
Member Author

cc @zhztheplayer @philo-he could you please take a look thanks!

@ArnavBalyan ArnavBalyan force-pushed the arnavb/existence-join branch from 09b402f to b66a40c Compare May 12, 2025 16:30
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

1 similar comment
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@ArnavBalyan ArnavBalyan force-pushed the arnavb/existence-join branch from cb2d355 to 305e011 Compare May 13, 2025 01:03
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@ArnavBalyan ArnavBalyan force-pushed the arnavb/existence-join branch from d29f843 to 2479d1f Compare May 15, 2025 12:00
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

Copy link
Copy Markdown
Member

@philo-he philo-he left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good! Just one comment on test. cc @rui-mo

class GlutenExistenceJoinSuite extends ExistenceJoinSuite with GlutenSQLTestsBaseTrait {}
class GlutenExistenceJoinSuite extends ExistenceJoinSuite with GlutenSQLTestsBaseTrait {

test("test existence join with broadcast nested loop join") {
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we create a new test class under backends-velox/ to include this test? Then, it will be tested on all supported spark versions.

BTW, please add a check to ensure the operator is really offloaded to Gluten/Velox.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

updated thanks

@ArnavBalyan ArnavBalyan force-pushed the arnavb/existence-join branch from 2479d1f to 07cd172 Compare May 15, 2025 16:43
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@ArnavBalyan
Copy link
Copy Markdown
Member Author

cc @philo-he @zhztheplayer gentle reminder thanks!

Copy link
Copy Markdown
Member

@philo-he philo-he left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good! Some minor comments.

}
}

protected def backendSpecificJoinValidation(): Option[ValidationResult] = None
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this new method being used anywhere? If not, please remove it.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

updated


class GlutenExistenceJoinSuite extends VeloxWholeStageTransformerSuite with SQLTestUtils {

test("test existence join with broadcast nested loop join") {
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nit: remove "test" from test name.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done thanks

assert(count == 1, s"Expected 1 VeloxBroadcastNestedLoopJoinExecTransformer, but found $count")
}
override protected val resourcePath: String = "N/A"
override protected val fileFormat: String = "N/A"
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nit: move the above overriding val at the class's beginning part.

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

removed

@ArnavBalyan ArnavBalyan force-pushed the arnavb/existence-join branch from 07cd172 to 4dd4db8 Compare May 19, 2025 07:00
@github-actions
Copy link
Copy Markdown

Run Gluten Clickhouse CI on x86

@philo-he philo-he merged commit c46b86a into apache:main May 19, 2025
47 of 48 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLICKHOUSE CORE works for Gluten Core VELOX

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants