[GLUTEN-9152][CORE] Avoid unnecessary serialization of hadoop conf#9153
[GLUTEN-9152][CORE] Avoid unnecessary serialization of hadoop conf#9153zml1206 merged 9 commits intoapache:mainfrom
Conversation
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
| rootPaths: Seq[String], | ||
| properties: Map[String, String], | ||
| serializableHadoopConf: Option[SerializableConfiguration] = None): ValidationResult = { | ||
| hadoopConf: Configuration): ValidationResult = { |
There was a problem hiding this comment.
There was a problem hiding this comment.
But from the code logic, I did not find that velox uses the hadoop conf here.
There was a problem hiding this comment.
@wangyum Could you please help in verifying this PR in your ViewFS environment? My ViewFS test setup is currently not functioning properly.
There was a problem hiding this comment.
I have tested this patch on our cluster and it still works.
|
|
||
| val sparkConf: SparkConf = sparkContext.getConf | ||
|
|
||
| val serializableHadoopConf: SerializableConfiguration = new SerializableConfiguration( |
There was a problem hiding this comment.
maybe add @transient lazy could fix the issue
There was a problem hiding this comment.
If serialization is require, lazy is a solution to avoid serialization without enabling viewFS, but if serialization is not needed here, it would be simpler to use sparkContext.hadoopConfiguration directly.
|
Run Gluten Clickhouse CI on x86 |
|
@zml1206 can you help to take a look the failed unit tests? |
It's strange that PR seems to cause Java heap oom, but JDK 17 passes. I haven't found the reason yet. I'm not sure whether it is related to PR or the test itself is unstable. |
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
|
Run Gluten Clickhouse CI on x86 |
|
I can't reproduce it locally. I tried many ways, including |
@zml1206, do you mean reverting the changes for |
Yes, revert the changes for |
philo-he
left a comment
There was a problem hiding this comment.
I think we can merge this patch first. If there is some issue found at user side, we can make a follow-up pr.
|
Thanks for everyone's review, if no one else has any comments, I'll merge it later. |
What changes were proposed in this pull request?
Significantly reduce the closure serialization time of
BasicScanExecTransformerandWholeStageTransformer.(Fixes: #9152)
How was this patch tested?