Skip to content

Sort by rollup key in hadoop indexer #1095

@gianm

Description

@gianm

Hadoop indexing reducers sort their data by timestamp, which is probably somewhat helpful but less helpful than it could be in terms of minimizing the amount of data spilled to disk. For not-already-rolled-up data, we'd spill less and merge faster if we sort by the entire rollup key (truncated timestamp + dimensions).

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions