Fix SST files not being cleaned up in the locations folder #4555

danpi · 2025-02-20T09:31:26Z

Descriptions of the changes in this PR:

Fix #4554

Motivation

Resolve issue with SST files remaining uncleared in the locations folder.

Changes

Reuse the cleanup logic of GarbageCollectorThread to trigger entryLocationCompact() when a major compaction occurs.
Add the entryLocationCompaction configuration option to control the above behavior, disabled by default.

danpi · 2025-02-20T09:39:48Z

I believe this issue is related to optimization #3653.

When using deleteRange to replace the one-by-one deletion method, only one action is written to RocksDB. If RocksDB is performing tiered compaction, this action is not guaranteed to be picked up, causing SST files to remain undeleted.

To resolve this issue:
Option 1: Revert to the previous logic of iteratively deleting each key.
Option 2: Use compactRange to forcefully clean up levels L0 to L6.
I chose to perform a compactRange synchronously when executing a major compaction.

bookkeeper-server/src/main/java/org/apache/bookkeeper/bookie/GarbageCollectorThread.java

hangc0276 · 2025-02-27T06:51:37Z

I believe this issue is related to optimization #3653.

When using deleteRange to replace the one-by-one deletion method, only one action is written to RocksDB. If RocksDB is performing tiered compaction, this action is not guaranteed to be picked up, causing SST files to remain undeleted.

To resolve this issue: Option 1: Revert to the previous logic of iteratively deleting each key. Option 2: Use compactRange to forcefully clean up levels L0 to L6. I chose to perform a compactRange synchronously when executing a major compaction.

@danpi Great job! I think it should be a bug in RocksDB. We can use option 2. Please check the failed CI, thanks.

bookkeeper-server/src/main/java/org/apache/bookkeeper/bookie/GarbageCollectorThread.java

…erval

danpi · 2025-03-06T08:18:06Z

@hangc0276 Addressed the feedback above. PTAL, thanks.

hangc0276 · 2025-03-06T18:11:47Z

bookkeeper-server/src/main/java/org/apache/bookkeeper/bookie/GarbageCollectorThread.java

@@ -87,6 +87,10 @@ public class GarbageCollectorThread implements Runnable {
    long majorCompactionMaxTimeMillis;
    long lastMajorCompactionTime;

+    boolean enableEntryLocationCompaction = false;


Remove this one and use entryLocationCompactionInterval > 0 instead?

hangc0276 · 2025-03-06T18:13:27Z

bookkeeper-server/src/main/java/org/apache/bookkeeper/bookie/GarbageCollectorThread.java

@@ -489,6 +505,17 @@ public void runWithFlags(boolean force, boolean suspendMajor, boolean suspendMin
                    minorCompacting.set(false);
                }
            }
+            if (enableEntryLocationCompaction && (curTime - lastEntryLocationCompactionTime


We'd better introduce a random factor for it. If we roll restart the BookKeeper cluster, and all the bookies start time are the same, which will make all the bookies triggered RocksDB compaction at the same time and impact the read latency.

The random factor can be any time in [0, entryLocationCompactionInterval]

hangc0276 · 2025-03-06T18:14:51Z

@hangc0276 Addressed the feedback above. PTAL, thanks.

@danpi Thanks for your contribution. I left more comments; please take a look. Thanks.

…ion simultaneously

danpi · 2025-03-10T07:02:28Z

@hangc0276 Addressed the feedback above. PTAL, thanks.

@danpi Thanks for your contribution. I left more comments; please take a look. Thanks.

@hangc0276 Add randomCompactionDelay to avoid all the bookies triggering compaction simultaneously. PTAL, thanks.

fix entry location compaction

97a38da

hangc0276 reviewed Feb 27, 2025

View reviewed changes

bookkeeper-server/src/main/java/org/apache/bookkeeper/bookie/GarbageCollectorThread.java Outdated Show resolved Hide resolved

hangc0276 reviewed Feb 27, 2025

View reviewed changes

bookkeeper-server/src/main/java/org/apache/bookkeeper/bookie/GarbageCollectorThread.java Outdated Show resolved Hide resolved

hangc0276 assigned danpi Feb 27, 2025

hangc0276 added type/bug area/bookie release/4.16.7 release/4.17.2 labels Feb 27, 2025

hangc0276 requested review from lhotari, eolivelli and zymap February 27, 2025 06:55

replace entryLocationCompactionEnable with entryLocationCompactionInt…

9e7a640

…erval

hangc0276 reviewed Mar 6, 2025

View reviewed changes

Add randomCompactionDelay to avoid all the bookies triggering compact…

ea8f55e

…ion simultaneously

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix SST files not being cleaned up in the locations folder #4555

Fix SST files not being cleaned up in the locations folder #4555

danpi commented Feb 20, 2025

danpi commented Feb 20, 2025

hangc0276 commented Feb 27, 2025 •

edited

Loading

danpi commented Mar 6, 2025

hangc0276 Mar 6, 2025

hangc0276 Mar 6, 2025

hangc0276 Mar 6, 2025

hangc0276 commented Mar 6, 2025

danpi commented Mar 10, 2025

Fix SST files not being cleaned up in the locations folder #4555

Are you sure you want to change the base?

Fix SST files not being cleaned up in the locations folder #4555

Conversation

danpi commented Feb 20, 2025

Motivation

Changes

danpi commented Feb 20, 2025

hangc0276 commented Feb 27, 2025 • edited Loading

danpi commented Mar 6, 2025

hangc0276 Mar 6, 2025

Choose a reason for hiding this comment

hangc0276 Mar 6, 2025

Choose a reason for hiding this comment

hangc0276 Mar 6, 2025

Choose a reason for hiding this comment

hangc0276 commented Mar 6, 2025

danpi commented Mar 10, 2025

hangc0276 commented Feb 27, 2025 •

edited

Loading