[GLUTEN-10107][CH] Decouple Celeborn-related code from CH backend module#10537
[GLUTEN-10107][CH] Decouple Celeborn-related code from CH backend module#10537philo-he merged 6 commits intoapache:mainfrom
Conversation
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
|
I noticed there exist a ci failure but I can't see the details. Can anyone help provide the error log? |
|
Run Gluten Clickhouse CI on x86 |
|
Is there any error message reported from the following? |
Sorry, I missed account info here. |
|
Run Gluten Clickhouse CI on x86 |
1a84125 to
7b9b6ac
Compare
|
Run Gluten Clickhouse CI on x86 |
|
@zhztheplayer Would you like to help review this modification,too? thx a lot |
...-celeborn/src/main/java/org/apache/spark/shuffle/gluten/celeborn/CelebornShuffleManager.java
Outdated
Show resolved
Hide resolved
...rc-celeborn/main/scala/org/apache/spark/shuffle/CHCelebornColumnarBatchSerlizerFactory.scala
Outdated
Show resolved
Hide resolved
|
Run Gluten Clickhouse CI on x86 |
|
Cc: @baibaichen @zzcclp |
...-celeborn/src/main/java/org/apache/spark/shuffle/gluten/celeborn/CelebornShuffleManager.java
Outdated
Show resolved
Hide resolved
...rc-celeborn/main/scala/org/apache/spark/shuffle/CHCelebornColumnarBatchSerlizerFactory.scala
Show resolved
Hide resolved
| // backendName); | ||
| // } | ||
| columnarBatchSerializerFactory = columanrBatchSerilizerFactoryMap.get(backendName); | ||
| if (!columnarBatchSerilizerFactoryMap.containsKey(backendName)) { |
There was a problem hiding this comment.
Not directly related to this PR's target.
I think using a map might be unnecessary. As we know, the implementation for ColumnarBatchSerializerFactory is exclusive, meaning Velox's factory implementation can only be loaded by the Velox backend, and CH's implementation can only be loaded by the CH backend. Then, it seems we can just check whether a concrete factory is loaded. There's no need to store the backend name in the factory or check the factory's backend name to confirm it belongs to the current backend.
This is just a preliminary thought. Am I missing something?
There was a problem hiding this comment.
I think you are right. Just open a issue to track this insightful idea #10577
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
philo-he
left a comment
There was a problem hiding this comment.
Looks good. Will merge this PR today if no other comments from the community.
What changes are proposed in this pull request?
As a follow up of #10201, aims to complete #10107
This MR mainly focus on decouple celeborn config from ch backend.
How was this patch tested?
Existing test.