Backend
VL (Velox)
Bug description
I can see that for the same record numbers, gluten always write much larger shuffle data than vanilla, why? From my understanding, gluten should use less storage since it's written in columnar format.
gluten

vanilla

Spark version
Spark-3.2.x
Spark configurations
No response
System information
gluten version: 1.3.0
Relevant logs
Backend
VL (Velox)
Bug description
I can see that for the same record numbers, gluten always write much larger shuffle data than vanilla, why? From my understanding, gluten should use less storage since it's written in columnar format.
gluten
vanilla
Spark version
Spark-3.2.x
Spark configurations
No response
System information
gluten version: 1.3.0
Relevant logs