ISSUE-2806: OOM as 1 million ledgers per entry log #2891

suiyuzeng · 2021-11-12T03:54:29Z

Motivation

fix issue 2806 OOM as 1 million ledgers per entry log

Changes

1.Add two config entryLogLedgerMapConcurrency and flushEntrySortBufferInitSize, to avoid allocating a large contiguous memory.
2.Reduce memory usage of EntryLogMetadata of entry logs in garbage collector. If the retention time of all the entry is the same, we do not need to extract the entry logs in retention time to load meta to memory. In org.apache.bookkeeper.bookie.GarbageCollectorThread#extractMetaFromEntryLogs, if the entry log in retention time, do not extract the entry log.

Vanlightly · 2021-11-12T11:03:20Z

Regarding garbage collection, retention is not determined by BookKeeper but whoever controls the ledger metadata (Pulsar for example). I don't think using a configured time period is the way to address this. The issue is that we try to store all metadata in memory, but given that each entry log has a ledger map, we can perform a scan process where we incrementally perform GC without all metadata in memory.

suiyuzeng · 2021-11-18T11:45:19Z

Regarding garbage collection, retention is not determined by BookKeeper but whoever controls the ledger metadata (Pulsar for example). I don't think using a configured time period is the way to address this. The issue is that we try to store all metadata in memory, but given that each entry log has a ledger map, we can perform a scan process where we incrementally perform GC without all metadata in memory.

Thanks for your suggestion.
I meet the issue in puslar for iot. There are millions topics which has the retention time. It works well in this scene. But it may be not common way to the problem like this. A scan process is better for this. I will fix the issue in this way.

suiyuzeng · 2021-11-18T12:11:27Z

If the count of ledger is not veray large but the entry log count is very large, the memory of the ledger map will not be very large and it will lead to io read, especially for extractEntryLogMetadataByScanning. The current way is better for this. How about reserving two way and adding a config to choose?
@Vanlightly

Vanlightly · 2021-11-24T16:10:14Z

@suiyuzeng it could be a way forward. I think depending on the result we may be happy with just using the scan method. So at first make it configurable which will allow us to evaluate both methods under similar conditions.

suiyuzeng · 2021-11-30T09:30:21Z

ok

suiyuzeng · 2022-01-10T12:21:45Z

@suiyuzeng it could be a way forward. I think depending on the result we may be happy with just using the scan method. So at first make it configurable which will allow us to evaluate both methods under similar conditions.

@Vanlightly hi, I find #1949 merged recently fix the issue when i develop the scan way. It store the meta into rocksdb. This way reduce the memory and just read the meta from the entrylog for one times. Do we need the scan way any more?

StevenLuMT · 2022-07-29T04:14:08Z

if #1949 fix the issue, please know it @hangc0276 @Vanlightly
@suiyuzeng please close your pr, thanks

ISSUE-2806: OOM as 1 million ledgers per entry log

46a7494

hangc0276 requested review from Vanlightly, eolivelli, hangc0276, hezhangjian, merlimat, nicoloboschi and zymap July 25, 2022 02:12

hangc0276 assigned suiyuzeng Jul 25, 2022

hangc0276 added type/bug area/bookie labels Jul 25, 2022

hangc0276 added this to the 4.16.0 milestone Jul 25, 2022

suiyuzeng closed this Jul 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ISSUE-2806: OOM as 1 million ledgers per entry log #2891

ISSUE-2806: OOM as 1 million ledgers per entry log #2891

Uh oh!

suiyuzeng commented Nov 12, 2021

Uh oh!

Vanlightly commented Nov 12, 2021

Uh oh!

suiyuzeng commented Nov 18, 2021 •

edited

Loading

Uh oh!

suiyuzeng commented Nov 18, 2021

Uh oh!

Vanlightly commented Nov 24, 2021

Uh oh!

suiyuzeng commented Nov 30, 2021

Uh oh!

suiyuzeng commented Jan 10, 2022

Uh oh!

StevenLuMT commented Jul 29, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ISSUE-2806: OOM as 1 million ledgers per entry log #2891

ISSUE-2806: OOM as 1 million ledgers per entry log #2891

Uh oh!

Conversation

suiyuzeng commented Nov 12, 2021

Motivation

Changes

Uh oh!

Vanlightly commented Nov 12, 2021

Uh oh!

suiyuzeng commented Nov 18, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

suiyuzeng commented Nov 18, 2021

Uh oh!

Vanlightly commented Nov 24, 2021

Uh oh!

suiyuzeng commented Nov 30, 2021

Uh oh!

suiyuzeng commented Jan 10, 2022

Uh oh!

StevenLuMT commented Jul 29, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

suiyuzeng commented Nov 18, 2021 •

edited

Loading