Fix core dumps triggered by rocksdb compacting when shutdown bk #4706

AnonHxy · 2026-01-20T16:06:21Z

Descriptions of the changes in this PR:

Motivation

Changes

1.Setting the compacting flag of entryLocationIndex as true when shutdown LedgerStorage to stopping the subsequent compact.
2. When LedgerStorage shutdown we will wait unit the compact end.

zymap · 2026-01-21T02:03:58Z

...r/src/main/java/org/apache/bookkeeper/bookie/storage/ldb/SingleDirectoryDbLedgerStorage.java

+            while (!entryLocationIndex.compareAndSetCompacting(false, true)) {
+                Thread.sleep(100);
+            }


Shouldn't it be done in the close method in the EntryLocationIndex?

make sense @zymap

StevenLuMT · 2026-01-25T22:47:29Z

...keeper-server/src/main/java/org/apache/bookkeeper/bookie/storage/ldb/EntryLocationIndex.java

    @Override
    public void close() throws IOException {
+        log.info("Closing EntryLocationIndex");
+        while (!compacting.compareAndSet(false, true)) {


This waiting could result in the system being stuck here indefinitely, or it could take an exceptionally long time to get stuck at this step.

Should we add a maximum waiting time, or do something else, such as modifying RocksDB operations?

I dont think so. Closing a DB which in compaction status may triger core dumps or other unexpected error. If Closing the DB cost toot long time, I think we'd better find out why or kill -9 if we need.

The handling approach here is similar to the procedure below：

bookkeeper/bookkeeper-server/src/main/java/org/apache/bookkeeper/bookie/GarbageCollectorThread.java

Lines 770 to 772 in 3a5cf9d

while (!compacting.compareAndSet(false, true)) {

// Wait till the thread stops compacting

Thread.sleep(100);

StevenLuMT · 2026-01-25T22:50:46Z

...keeper-server/src/main/java/org/apache/bookkeeper/bookie/storage/ldb/EntryLocationIndex.java

    public void compact() throws IOException {
        try {
-            isCompacting = true;
+            if (!compacting.compareAndSet(false, true)) {


I have a question:

Is the isCompacting syntax the root cause of this problem?
Or is it simply a matter of rewriting the method more efficiently, thus ruling out issues with the isCompacting setting?

Yes. It's the root cause. We should cancel the compacting if we have closed the DB, or we should delay closing the DB if we have already been in compaction status. So here We need an atomic variable to serve as a flag for the DB status. @StevenLuMT

Fix core dumps triggered by rocksdb compacting when shutdown bk

bc91be9

AnonHxy force-pushed the fix_rocksdb_compact_coredump branch from 560ace4 to bc91be9 Compare January 20, 2026 16:09

checkstyle

3e60a8d

zymap reviewed Jan 21, 2026

View reviewed changes

AnonHxy added 2 commits January 22, 2026 00:47

address comment

67a6ac2

log

7018ea1

AnonHxy force-pushed the fix_rocksdb_compact_coredump branch from ecaea19 to 7018ea1 Compare January 22, 2026 01:18

optimize and ut

abee9aa

AnonHxy force-pushed the fix_rocksdb_compact_coredump branch from 1556a35 to abee9aa Compare January 22, 2026 05:01

StevenLuMT reviewed Jan 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix core dumps triggered by rocksdb compacting when shutdown bk #4706

Fix core dumps triggered by rocksdb compacting when shutdown bk #4706

AnonHxy commented Jan 20, 2026

Uh oh!

zymap Jan 21, 2026

Uh oh!

AnonHxy Jan 21, 2026

Uh oh!

StevenLuMT Jan 25, 2026

Uh oh!

AnonHxy Jan 26, 2026

Uh oh!

StevenLuMT Jan 25, 2026

Uh oh!

AnonHxy Jan 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	while (!compacting.compareAndSet(false, true)) {
	// Wait till the thread stops compacting
	Thread.sleep(100);

Fix core dumps triggered by rocksdb compacting when shutdown bk #4706

Are you sure you want to change the base?

Fix core dumps triggered by rocksdb compacting when shutdown bk #4706

Conversation

AnonHxy commented Jan 20, 2026

Motivation

Changes

Uh oh!

zymap Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

AnonHxy Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

StevenLuMT Jan 25, 2026

Choose a reason for hiding this comment

Uh oh!

AnonHxy Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

StevenLuMT Jan 25, 2026

Choose a reason for hiding this comment

Uh oh!

AnonHxy Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants