Skip to content

[Proposal] Skip writing value lengths in the indexes #3070

@pjain1

Description

@pjain1

Druid uses format described in GenericIndexed class to serialize indexes. While writing index values, Druid writes the length of values in bytes before each value.

However, this may be skipped as end offsets of each serialized value is already written before the values are stored. This can lead to savings of (4 bytes * number of values in the index) for each index in the final segment. Some examples of GenericIndexed index are for writing Dictionary values, corresponding bitmaps etc.

Corresponding PR for discussion - #3069

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions