Do not clear `pendingCompletionTaskGroups` in `clearAllocationInfo` by amaechler · Pull Request #18715 · apache/druid

amaechler · 2025-11-04T16:51:01Z

Description

Fixes a bug where the SeekableStream supervisor autoscaler creates duplicate history entries every minTriggerScaleActionFrequencyMillis (default 10min) during scale-down operations, causing database pollution and preventing scale-down from completing.

Lots of help from Claude.

Problem

When the autoscaler scales down tasks, clearAllocationInfo() prematurely clears pendingCompletionTaskGroups, causing the supervisor to "forget" about tasks transitioning from READING to PUBLISHING state. On the next supervisor cycle, these tasks are rediscovered and re-added to activelyReadingTaskGroups, triggering another scale-down attempt and creating a duplicate history entry. This repeats every minTriggerScaleActionFrequencyMillis (default: 10 minutes). I saw hundreds of duplicate history entries, with entries created at exact 10-minute intervals.

The root cause is that the autoscaler has a built-in safeguard (line 480-496) to skip scale actions when pendingCompletionTaskGroups is non-empty, but this check is ineffective because clearAllocationInfo() clears the map immediately after tasks were moved there.

Solution

Preserve pendingCompletionTaskGroups in clearAllocationInfo(). This allows the autoscaler's existing skip logic to function correctly, preventing duplicate scale attempts until tasks naturally complete (removed by checkPendingCompletionTasks() every supervisor cycle).

Release note

Fixed a bug in the SeekableStream supervisor autoscaler where scale-down operations would create duplicate supervisor history entries. The autoscaler now correctly waits for tasks to complete before attempting subsequent scale operations.

Key changed/added classes in this PR

SeekableStreamSupervisor - Modified clearAllocationInfo() to preserve pendingCompletionTaskGroups

This PR has:

been self-reviewed.
added documentation for new or modified features or behaviors.
a release note entry in the PR description.
added Javadocs for most classes and all non-trivial methods. Linked related entities via Javadoc links.
added or updated version, license, or notice information in licenses.yaml
added comments explaining the "why" and the intent of the code wherever would not be obvious for an unfamiliar reader.
added unit tests or modified existing tests to cover new code paths, ensuring the threshold for code coverage is met.
added integration tests.
been tested in a test Druid cluster.

kfaraz

Thanks for the fix, @amaechler !

I have left a couple of minor suggestions.
We might want to add an embedded test for this race condition. But that need not block this bugfix.

amaechler · 2025-11-06T22:29:17Z

Thanks @kfaraz for taking the time to review! I updated the wording a bit based on your suggestions. I'm not sure about how I could rewrite the test to be more high-level, so I kept the actual test for now.

kfaraz

Thanks for the fix, @amaechler !

kfaraz · 2025-11-07T14:22:24Z

    partitionOffsets.clear();

-    pendingCompletionTaskGroups.clear();
+    // Note: We intentionally do NOT clear pendingCompletionTaskGroups here.


Since the original line of code has already been removed, this comment seems out of place here. It has already been called out in the javadoc anyway.

kfaraz · 2025-11-07T14:23:50Z

I'm not sure about how I could rewrite the test to be more high-level, so I kept the actual test for now.

Fair enough, we can address that and add an embedded test in follow up PRs.

Changes: - Do not clear `pendingCompletionTaskGroups` in `clearAllocationInfo` - Add unit test

Do not clear pendingCompletionTaskGroups in clearAllocationInfo

3a87f9c

github-actions Bot added Area - Streaming Ingestion Area - Ingestion labels Nov 4, 2025

Merge branch 'master' into fix-autoscaler-duplicate-history

0f37af0

kfaraz reviewed Nov 6, 2025

View reviewed changes

Better descriptions

aec7de9

amaechler added 2 commits November 7, 2025 04:13

Whitespace

a4d06dc

Merge branch 'master' into fix-autoscaler-duplicate-history

0dd6ede

kfaraz approved these changes Nov 7, 2025

View reviewed changes

Remove comment

88c11ae

kfaraz merged commit aec5c4a into apache:master Nov 8, 2025
127 of 130 checks passed

santosh-d3vpl3x pushed a commit to santosh-d3vpl3x/druid that referenced this pull request Dec 13, 2025

Fix duplicate actions in auto-scaler history (apache#18715)

2959cde

Changes: - Do not clear `pendingCompletionTaskGroups` in `clearAllocationInfo` - Add unit test

kgyrtkirk added this to the 36.0.0 milestone Jan 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not clear `pendingCompletionTaskGroups` in `clearAllocationInfo`#18715

Do not clear `pendingCompletionTaskGroups` in `clearAllocationInfo`#18715
kfaraz merged 6 commits intoapache:masterfrom
amaechler:fix-autoscaler-duplicate-history

amaechler commented Nov 4, 2025

Uh oh!

kfaraz left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

amaechler commented Nov 6, 2025

Uh oh!

kfaraz left a comment

Uh oh!

kfaraz Nov 7, 2025

Uh oh!

kfaraz commented Nov 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

amaechler commented Nov 4, 2025

Description

Problem

Solution

Release note

Key changed/added classes in this PR

Uh oh!

kfaraz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

amaechler commented Nov 6, 2025

Uh oh!

kfaraz left a comment

Choose a reason for hiding this comment

Uh oh!

kfaraz Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

kfaraz commented Nov 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants