Shared subscription: improvement with offloaded ledgers #16417

eolivelli · 2022-07-06T09:01:49Z

Motivation

When you use a Shared subscription the Dispatcher (PersistentDispatcherMultipleConsumers) does our of order reads from the ManagedLedger, this is happening very frequently in the consumerFlow method, that basically re-reads messages that have not been processed yet.

This happens because consumerFlow calls readMoreEntries() and this in turn finds that there are messages to be re-delivered, because not yet acknowledged by any consumer.

This fact triggers backward seeks in the BlobStoreBackedReadHandleImpl. Even if the data is already loaded in memory you have to parse it again most of the time, because the seek currently is done to the beginning of a block, because the index keeps only track of some entries and not of every entry.

Modifications

Add a in memory cache of the offsets of each entry, this way when you have to do a backward seek we do the seek exactly to the position where you can find the entry.

The trade-off here is that we build this TreeMap that adds some memory costs

Verifying this change

This change added tests.
I did some manual testing with GCP and this patch brings a x2 improvement on throughput of a Shared subscription (with pulsar-perf) with a single consumer.

@dave2wave did more extensive testing with complex scenarios with OpenMessaging Benchmark and confirmed that this patch brings a big improvement on Shared subscriptions.

doc-not-needed

dave2wave · 2022-07-06T15:29:36Z

We found trouble with consumption of an offloaded backlog using a shared subscription. (failover subscriptions perform well) Using the OMB we built a 64GB backlog of 100 byte messages with 10 topics with 3 partitions.

Shared subscriptions were consumed at 3000-10000 messages per second.
Failover subscriptions consumed at 1000000 messages per second

With this patch we see a big improvement and shared subscriptions are consumed like this:

...in/java/org/apache/bookkeeper/mledger/offload/jcloud/impl/BlobStoreBackedReadHandleImpl.java

hangc0276 · 2022-07-07T15:11:12Z

...in/java/org/apache/bookkeeper/mledger/offload/jcloud/impl/BlobStoreBackedReadHandleImpl.java

                        // never over to the last entry again.
                        if (!seeked) {
-                            inputStream.seek(index.getIndexEntryForEntry(nextExpectedId).getDataOffset());
+                            Long knownOffset = entryOffsets.get(nextExpectedId);


Maybe we can a function to package this logic and all the index.getIndexEntryForEntry(nextExpectedId).getDataOffset() call this function.

Other seek places also need to check the index cache first.

sounds good, in my testing it looked that only this point needs the cache, btw it is better to use it everywhere

...c/test/java/org/apache/bookkeeper/mledger/offload/jcloud/BlobStoreBackedInputStreamTest.java

hangc0276 · 2022-07-08T06:21:04Z

...in/java/org/apache/bookkeeper/mledger/offload/jcloud/impl/BlobStoreBackedReadHandleImpl.java

+    // this Cache is accessed only by one thread
+    private final Cache<Long, Long> entryOffsets = CacheBuilder
+            .newBuilder()
+            .expireAfterAccess(10, TimeUnit.MINUTES)


10 minutes may be too short, 30 minutes to 1 hour? This cache can be shared with other catch-up readers.

I have changed it to 30 minutes and made it configurable via system property
I don't think we have to document the system property, I made it configurable only in case of problems

hangc0276

Good job!

hangc0276 · 2022-07-08T07:34:53Z

ping @zymap @horizonzy please help take a look, thanks.

horizonzy

LGTM.

codelipenghui · 2022-07-13T05:16:24Z

@eolivelli Please rebase the master branch

…rove-shared

eolivelli · 2022-07-13T08:57:04Z

@Technoboy- @codelipenghui
I have merged with latest master by Test Group 1 still fails with:

eolivelli added type/enhancement The enhancements for the existing features or docs. e.g. reduce memory usage of the delayed messages area/tieredstorage doc-not-needed Your PR changes do not impact docs labels Jul 6, 2022

eolivelli added this to the 2.11.0 milestone Jul 6, 2022

eolivelli requested review from hangc0276 and zymap July 6, 2022 09:01

eolivelli self-assigned this Jul 6, 2022

eolivelli requested review from dlg99, lhotari and michaeljmarshall July 6, 2022 09:49

hangc0276 reviewed Jul 7, 2022

View reviewed changes

eolivelli force-pushed the impl/offloaders-improve-shared branch from 5997cd1 to afea8e9 Compare July 7, 2022 15:26

eolivelli requested a review from hangc0276 July 7, 2022 15:37

hangc0276 reviewed Jul 8, 2022

View reviewed changes

hangc0276 approved these changes Jul 8, 2022

View reviewed changes

eolivelli requested a review from hangc0276 July 8, 2022 10:24

eolivelli force-pushed the impl/offloaders-improve-shared branch 2 times, most recently from dba2dba to f0237a7 Compare July 8, 2022 11:55

zymap approved these changes Jul 11, 2022

View reviewed changes

eolivelli added 4 commits July 11, 2022 14:04

Shared subscription: improvement with offloaded ledgers

7f76e82

Use Cache and address some comments

d71d312

fix build

408c3ca

Change the TTL

388912d

eolivelli force-pushed the impl/offloaders-improve-shared branch from f0237a7 to 388912d Compare July 11, 2022 12:04

horizonzy approved these changes Jul 12, 2022

View reviewed changes

codelipenghui approved these changes Jul 13, 2022

View reviewed changes

Merge remote-tracking branch 'origin/master' into impl/offloaders-imp…

04d5cdc

…rove-shared

eolivelli merged commit 99bca8b into apache:master Jul 13, 2022

gaozhangmin pushed a commit to gaozhangmin/pulsar that referenced this pull request Jul 14, 2022

Shared subscription: improvement with offloaded ledgers (apache#16417)

f47bb08

wuxuanqicn pushed a commit to wuxuanqicn/pulsar that referenced this pull request Jul 14, 2022

Shared subscription: improvement with offloaded ledgers (apache#16417)

1b596e4

dragonls pushed a commit to dragonls/pulsar that referenced this pull request Oct 21, 2022

Shared subscription: improvement with offloaded ledgers (apache#16417)

dbfc009

lhotari mentioned this pull request May 8, 2024

[fix][offload] Fix OOM in tiered storage, caused by unbounded offsets cache #22679

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shared subscription: improvement with offloaded ledgers #16417

Shared subscription: improvement with offloaded ledgers #16417

Uh oh!

eolivelli commented Jul 6, 2022 •

edited by github-actions bot

Loading

Uh oh!

dave2wave commented Jul 6, 2022 •

edited by eolivelli

Loading

Uh oh!

Uh oh!

hangc0276 Jul 7, 2022

Uh oh!

eolivelli Jul 7, 2022

Uh oh!

Uh oh!

hangc0276 Jul 8, 2022

Uh oh!

eolivelli Jul 8, 2022

Uh oh!

hangc0276 left a comment

Uh oh!

hangc0276 commented Jul 8, 2022

Uh oh!

horizonzy left a comment

Uh oh!

codelipenghui commented Jul 13, 2022

Uh oh!

eolivelli commented Jul 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Shared subscription: improvement with offloaded ledgers #16417

Shared subscription: improvement with offloaded ledgers #16417

Uh oh!

Conversation

eolivelli commented Jul 6, 2022 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Verifying this change

Uh oh!

dave2wave commented Jul 6, 2022 • edited by eolivelli Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

hangc0276 Jul 7, 2022

Choose a reason for hiding this comment

Uh oh!

eolivelli Jul 7, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

hangc0276 Jul 8, 2022

Choose a reason for hiding this comment

Uh oh!

eolivelli Jul 8, 2022

Choose a reason for hiding this comment

Uh oh!

hangc0276 left a comment

Choose a reason for hiding this comment

Uh oh!

hangc0276 commented Jul 8, 2022

Uh oh!

horizonzy left a comment

Choose a reason for hiding this comment

Uh oh!

codelipenghui commented Jul 13, 2022

Uh oh!

eolivelli commented Jul 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

eolivelli commented Jul 6, 2022 •

edited by github-actions bot

Loading

dave2wave commented Jul 6, 2022 •

edited by eolivelli

Loading