Skip to content

DruidInputSource: Sort segments by ID before grouping into splits.#10646

Merged
gianm merged 1 commit intoapache:masterfrom
gianm:druid-split-time-locality
Dec 7, 2020
Merged

DruidInputSource: Sort segments by ID before grouping into splits.#10646
gianm merged 1 commit intoapache:masterfrom
gianm:druid-split-time-locality

Conversation

@gianm
Copy link
Copy Markdown
Contributor

@gianm gianm commented Dec 5, 2020

This is useful because it groups up segments for the same time chunk
into the same splits, which in turn is useful because it minimizes the
number of time chunks that each task will have to deal with.

This is useful because it groups up segments for the same time chunk
into the same splits, which in turn is useful because it minimizes the
number of time chunks that each task will have to deal with.
@gianm gianm force-pushed the druid-split-time-locality branch from 00b55c3 to b32ed89 Compare December 5, 2020 22:02
@jihoonson
Copy link
Copy Markdown
Contributor

LGTM

@gianm gianm merged commit 9acab0b into apache:master Dec 7, 2020
@gianm gianm deleted the druid-split-time-locality branch December 7, 2020 21:48
@jihoonson jihoonson added this to the 0.21.0 milestone Jan 4, 2021
JulianJaffePinterest pushed a commit to JulianJaffePinterest/druid that referenced this pull request Jan 22, 2021
…pache#10646)

This is useful because it groups up segments for the same time chunk
into the same splits, which in turn is useful because it minimizes the
number of time chunks that each task will have to deal with.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants