[WIP] Mapside aggregating hadoop indexer by navis · Pull Request #2670 · apache/druid

navis · 2016-03-16T02:11:12Z

Current hadoop indexer pushes rows one by one to context but if timestamp of rows are not varied (like hourly batch) that much we can aggregate rows in mapper memory first. I know there is combiner in hadoop but it's infamously inefficient and even hive didn't used that.

navis · 2016-03-16T02:12:09Z

Waiting #2650

binlijin · 2016-03-16T02:48:59Z

@navis , how about the performance improved?

gianm · 2016-03-16T02:49:15Z

have not read the impl yet, but this is a nice concept IMO. my one request at a high level is that it should either be off by default or have a very conservative default for maxRowsInMemory, to prevent people from hitting OOMEs on upgrade.

himanshug · 2016-03-16T03:07:12Z

hadoop combiner allows you to truly merge "all" the rows that are mergable at mapper because it is pretty much like the reducer and hadoop will carefully supply it all the rows that were mergeable together.
change in this PR can only do either best effort merging or it needs to hold a lot of IncrementalIndex in memory to be able to do the merging of all rows that were mergeable (remember mappers wouldn't get input data in any particular order).
Did you try to use "useCombiner" and notice any major problems which does the same thing? Is using hadoop combiner really becoming the bottleneck ?

gianm · 2016-03-16T03:35:15Z

My experience in general is that combiners are good if reducers are a big bottleneck and mappers are not. But if things are more balanced between mappers and reducers, then doing in-heap aggregation on the mappers incurs less mapper overhead than using a combiner (combiners involve serde and sorting costs). So it could be better or worse depending on the workload.

It's kind of a point of contention in the hadoop world though (some projects prefer combiners and some prefer in-heap aggregation…). Some even support both at the same time :)

It would certainly be good to know real numbers of this approach vs useCombiner for your workload, @navis. Even better if you could also share numbers of this approach + useCombiner used together vs either one alone.

himanshug · 2016-03-16T03:57:04Z

@gianm let say very first row and very last row in the dataset are mergeable together, then only way you will merge them without combiner if you held the IncrementalIndex for that group key till very last moment. Now multiply that by the total number of distinct group keys mapper gets if you truly want to merge everything that could be merged.

gianm · 2016-03-16T04:12:13Z

@himanshug it's not necessary to merge everything, the idea with in-heap aggregation is that it's just best effort, but that can be good enough for a substantial reduction in data sent to the reducers without incurring the overhead of guaranteed merging at both the mapper and reducer levels.

would be good to confirm for some real world workloads whether or not this approach works better than a combiner.

sirpkt · 2016-03-16T06:26:52Z

It needs to check whether inputRow is instance of SegmentInputRow.
It seems that combiningAggs should be used instead of aggregators in that case.

sirpkt · 2016-03-16T08:00:14Z

IMO this patch looks really good especially when the input data is roughly sorted by timestamp, and my machine logs are like that.
Map disk spill is one of major performance bottleneck in MR but combiner does nothing about that.
This patch can reduce the number of spills with minimized map output.
Even it works as best effort, it would be well fit with time series data I think.

navis · 2016-03-17T07:46:13Z

It needs more elaboration but,
with unique 740K rows, 54 sec took for current and 83 sec with this.
with unique 100K rows * 10 duplication, 37 sec for current (36 sec with combine) and 24 sec with this.

jaehc · 2016-03-17T10:59:18Z

How about use bucket intervals to make each IncrementalIndex to reduce memory footprint(maybe)? like the below.
config.getGranularitySpec().bucketInterval(timestamp).get();

addressed that

navis · 2016-03-22T01:05:59Z

I think @gianm already explained well (thanks!) on the intention of this patch. Current serde for inputRow is making very big complex binary and seemed not possible to use binary combiner (forgot the exact name of this in hadoop). Then for combining rows hadoop must read object form binary and write it again, which is big work for both cpu and memory.

stale · 2019-02-28T07:14:53Z

This pull request has been marked as stale due to 60 days of inactivity. It will be closed in 1 week if no further activity occurs. If you think that’s incorrect or this pull request requires a review, please simply write any comment. If closed, you can revive the PR at any time and @mention a reviewer or discuss it on the dev@druid.apache.org list. Thank you for your contributions.

stale · 2019-03-07T08:09:40Z

This pull request has been closed due to lack of activity. If you think that is incorrect, or the pull request requires review, you can revive the PR at any time.

sirpkt reviewed Mar 16, 2016
View reviewed changes

navis force-pushed the combining-hadoop-indexer branch from fe9b575 to 67d1100 Compare March 17, 2016 08:31

jaehc reviewed Mar 17, 2016
View reviewed changes

navis force-pushed the combining-hadoop-indexer branch from 0a0b334 to f90078e Compare March 18, 2016 07:49

navis mentioned this pull request Mar 21, 2016

TimeAndDims does not implement equals/hashcode #2692

Closed

navis force-pushed the combining-hadoop-indexer branch 2 times, most recently from 1eae336 to 1ee16a8 Compare May 20, 2016 01:05

navis added 5 commits June 7, 2016 11:55

Mapside aggregating hadoop indexer

30287ab

some refactorings and tests

806bfb9

use unsorted incremental index

fef2c19

use bucket interval for merging if possible

7e17546

Fixed bug on bucket condition

6c4d922

navis force-pushed the combining-hadoop-indexer branch from 1ee16a8 to 6c4d922 Compare June 7, 2016 02:56

stale Bot added the stale label Feb 28, 2019

stale Bot closed this Mar 7, 2019

Conversation

navis commented Mar 16, 2016

Uh oh!

navis commented Mar 16, 2016

Uh oh!

binlijin commented Mar 16, 2016

Uh oh!

gianm commented Mar 16, 2016

Uh oh!

himanshug commented Mar 16, 2016

Uh oh!

gianm commented Mar 16, 2016

Uh oh!

himanshug commented Mar 16, 2016

Uh oh!

gianm commented Mar 16, 2016

Uh oh!

sirpkt Mar 16, 2016

Choose a reason for hiding this comment

Uh oh!

sirpkt commented Mar 16, 2016

Uh oh!

navis commented Mar 17, 2016

Uh oh!

jaehc Mar 17, 2016

Choose a reason for hiding this comment

Uh oh!

navis Mar 22, 2016

Choose a reason for hiding this comment

Uh oh!

navis commented Mar 22, 2016

Uh oh!

stale Bot commented Feb 28, 2019

Uh oh!

stale Bot commented Mar 7, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants