[Improvement] DataSegment intern improvement (reduce 60% memory consume on coordinator) by pzhdfy · Pull Request #8165 · apache/druid

pzhdfy · 2019-07-26T10:24:28Z

Description

In our druid cluster, we have 10 millions active segments. And we set load rule with 2 replications.
Then we find coordinator consume 50GB memory and cause GC problem.

We dump the JVM and analysis the memory, then we found about 30 millions DataSegment objects.
This is because one segment will generate 3 DataSegment objects.
One is from poll DB in SQLMetadataSegmentManager
The other two is from zookeeper announcement(2 replications) in BatchServerInventoryView

This three DataSegment objects are usually the same,
So can use
Interner DATA_SEGMENT_INTERNER = Interners.newWeakInterner();
to deduplicate.

1.When poll from DB or read from Znode, we use DATA_SEGMENT_INTERNER to deduplicate.
2.When poll from DB, always update loadSpec in DataSegment, this is useful when deep storage migration.
3.When read from Znode, skip intern the realtime node.Because segment from realtime is short-time living and has incorrect size, dimensions,loadSpect

With this improvement.
The memory consume in coordinator reduces to 20GB.
And this is also useful in broker, from 35G to 18GB.

leventov · 2019-07-26T17:04:13Z

  private static final Interner<String> STRING_INTERNER = Interners.newWeakInterner();
  private static final Interner<List<String>> DIMENSIONS_INTERNER = Interners.newWeakInterner();
  private static final Interner<List<String>> METRICS_INTERNER = Interners.newWeakInterner();
+  private static final Interner<DataSegment> DATA_SEGMENT_INTERNER = Interners.newWeakInterner();


This humongous Map with millions of weak references would be a problem for GC in itself: see #6357 for context.

You should adopt the design with BatchServerInverntoryView and SQLSegmentMetadataManager probing into each other's memory, similarly to what is explained here: #7395 (comment)

leventov · 2019-07-26T17:08:18Z

-  private final Map<String, Object> loadSpec;
-  private final List<String> dimensions;
-  private final List<String> metrics;
+  private volatile Map<String, Object> loadSpec;


Making DataSegment non-immutable was ruled out in this discussion: #7571. Please read it in full. I proposed a solution here: #7571 (comment). Please check if you can implement it in this PR.

leventov · 2019-07-26T17:09:34Z

+  {
+    DataSegment result = DATA_SEGMENT_INTERNER.intern(dataSegment);
+    if (updateLoadSpec) {
+      result.dimensions = dataSegment.dimensions;


I think DataSegment shouldn't become mutable. However, it would be nice if you would solve this problem in this PR: #6358.

drcrallen · 2019-08-07T21:23:19Z

Some history on interning and data segments for any archeologists out there: #3238
and its downfall #3286

stale · 2019-10-06T22:03:02Z

This pull request has been marked as stale due to 60 days of inactivity. It will be closed in 4 weeks if no further activity occurs. If you think that's incorrect or this pull request should instead be reviewed, please simply write any comment. Even if closed, you can still revive the PR at any time or discuss it on the dev@druid.apache.org list. Thank you for your contributions.

stale · 2019-11-03T23:00:51Z

This pull request/issue has been closed due to lack of activity. If you think that is incorrect, or the pull request requires review, you can revive the PR at any time.

dengfangyuan added 3 commits July 26, 2019 17:50

DataSegment intern improvement

155d242

delete unused replaceWithExistingSegmentIfPresent

c5a1eed

Merge branch 'master' into datasegment_intern

6173182

leventov self-requested a review July 26, 2019 16:24

leventov added the Performance label Jul 26, 2019

leventov requested changes Jul 26, 2019

View reviewed changes

leventov added the Area - Metadata label Jul 26, 2019

leventov mentioned this pull request Sep 23, 2019

Decouple segment storage and serving on Historicals #8575

Open

stale Bot added the stale label Oct 6, 2019

stale Bot closed this Nov 3, 2019

navis mentioned this pull request Jun 1, 2020

Remove excessive DataSegment in coordinator metatron-app/metatron-discovery#3206

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Improvement] DataSegment intern improvement (reduce 60% memory consume on coordinator)#8165

[Improvement] DataSegment intern improvement (reduce 60% memory consume on coordinator)#8165
pzhdfy wants to merge 3 commits intoapache:masterfrom
pzhdfy:datasegment_intern

pzhdfy commented Jul 26, 2019

Uh oh!

leventov Jul 26, 2019

Uh oh!

leventov Jul 26, 2019

Uh oh!

leventov Jul 26, 2019

Uh oh!

drcrallen commented Aug 7, 2019

Uh oh!

stale Bot commented Oct 6, 2019

Uh oh!

stale Bot commented Nov 3, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

pzhdfy commented Jul 26, 2019

Description

Uh oh!

leventov Jul 26, 2019

Choose a reason for hiding this comment

Uh oh!

leventov Jul 26, 2019

Choose a reason for hiding this comment

Uh oh!

leventov Jul 26, 2019

Choose a reason for hiding this comment

Uh oh!

drcrallen commented Aug 7, 2019

Uh oh!

stale Bot commented Oct 6, 2019

Uh oh!

stale Bot commented Nov 3, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants