Skip to content

Determine partitions better#56

Merged
cheddar merged 5 commits intomasterfrom
determine-partitions
Jan 24, 2013
Merged

Determine partitions better#56
cheddar merged 5 commits intomasterfrom
determine-partitions

Conversation

@gianm
Copy link
Copy Markdown
Contributor

@gianm gianm commented Jan 16, 2013

  • Can handle non-rolled-up input (by grouping input rows using an additional MR stage)
  • Can select its own partitioning dimension, if none is supplied
  • Can detect and avoid oversized shards due to bad dimension value distribution
  • Shares input parsing code with IndexGeneratorJob

gianm added 3 commits January 15, 2013 22:30
- Can handle non-rolled-up input (by grouping input rows using an additional MR stage)
- Can select its own partitioning dimension, if none is supplied
- Can detect and avoid oversized shards due to bad dimension value distribution
- Shares input parsing code with IndexGeneratorJob
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's kinda weird to pull the configuration out for each and every property... Looks like that's how the code was, which was needless. Let's fix that up now and just get the configuration once.

gianm added 2 commits January 21, 2013 14:38
HadoopDruidIndexerConfig:
- Add partitionsSpec (backwards compatible with targetPartitionSize and partitionDimension)
- Add assumeGrouped flag to partitionsSpec

DeterminePartitionsJob:
- Skip group-by job if assumeGrouped is set
- Clean up code a bit
Conflicts:
	indexer/src/main/java/com/metamx/druid/indexer/IndexGeneratorJob.java
cheddar added a commit that referenced this pull request Jan 24, 2013
@cheddar cheddar merged commit ec034dd into master Jan 24, 2013
@cheddar cheddar deleted the determine-partitions branch January 24, 2013 20:57
cheddar pushed a commit to cheddar/druid that referenced this pull request Jul 1, 2015
New UT for classes RetryPolicyFactory and TaskToolbox
jon-wei pushed a commit to jon-wei/druid that referenced this pull request Feb 13, 2020
… (apache#56)

* ANY Aggregator should not skip null values implementation

* add tests

* add more tests

* Update documentation

* add more tests

* address review comments

* optimize StringAnyBufferAggregator

* fix failing tests

* address pr comments
GabrielCWT added a commit to GabrielCWT/druid that referenced this pull request Oct 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants