Skip to content

Reuse previous CometDictionary Java arrays #488

@viirya

Description

@viirya

Describe the bug

After #474, ColumnReader.loadVector initiates CometDictionary after re-import arrays. It is reported by @andygrove (#473 (comment)) that after the patch, there is obvious regression when comparing with Spark.

It is because, although re-importing dictionary values is fast, CometDictionary has an initialization step to copy values from imported dictionary values to an internal Java array. We need to re-use previous CometDictionary's copied Java arrays.

Steps to reproduce

No response

Expected behavior

No response

Additional context

No response

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions