Restrict segment metadata kill query till maxInterval from last kill task time #17770

chetanpatidar26 · 2025-03-03T05:30:11Z

Description

In The current setup in killUnusedSegments, the segmentsMetadataManager.getUnusedSegmentIntervals queries for unused segments from the last segment kill time to the durationToRetain. This results in a expensive SQL query in some cases, where huge amount of segments are created in less time. Although we have a config druid.coordinator.kill.maxInterval which is set to 30 days by default, we are not using it. This config can be used to restrict the query to scan for only 30 days of data.

Example -
At confluent we create segments with granularity of 5 minutes, our current coordinator kill task is working on deleting segments from April 2022 and we have set durationToRetain=720 days. So everytime findIntervalToKill method runs it queries for segment data of nearly 350 days, this resulted in huge spike in handoff time.

We updated the durationToRetain=1000 days thus it resulted in scanning data of 50 days and handoff time was in control again.

Proposed fix

In findIntervalToKill method, restricting the maxEndTime to the maximum Interval from the time last segment was killed.

case 1

When a datasource is created and the first segment from it is killed we will have minStartTime as null because it is initialised as the end time of last segment killed, in this scenario the maxEndTime will be durationToRetain.

case 2

After the first segment is killed from this datasource the minStartTime will be initialised, thus now the maxEndTime will be
minimum(durationToRetain, minStartTime + maxInterval)

Release note

This PR has:

…time

server/src/main/java/org/apache/druid/server/coordinator/duty/KillUnusedSegments.java

kfaraz

Thanks for the changes, @chetanpatidar26 !
The logic makes sense to me. I have left some suggestions.

server/src/main/java/org/apache/druid/server/coordinator/duty/KillUnusedSegments.java

server/src/test/java/org/apache/druid/server/coordinator/duty/KillUnusedSegmentsTest.java

kfaraz

Thanks for the contribution, @chetanpatidar26 !

chetanpatidar26 · 2025-03-04T08:00:16Z

Thanks for the review @kfaraz !
Thank you @rbankar7 for helping in fixing the issue.

Restrict segment metadata kill query till maxInterval from last kill …

00bf784

…time

chetanpatidar26 changed the title ~~Restrict segment metadata kill query till maxInterval from last kill …~~ Restrict segment metadata kill query till maxInterval from last kill task time Mar 3, 2025

fix variables

2ee9a84

chetanpatidar26 marked this pull request as ready for review March 3, 2025 06:55

kfaraz reviewed Mar 3, 2025

View reviewed changes

server/src/main/java/org/apache/druid/server/coordinator/duty/KillUnusedSegments.java Outdated Show resolved Hide resolved

fix variables

3a3a847

chetanpatidar26 requested a review from kfaraz March 3, 2025 13:20

kfaraz reviewed Mar 3, 2025

View reviewed changes

addressed comments

91a8921

chetanpatidar26 requested a review from kfaraz March 4, 2025 04:18

kfaraz approved these changes Mar 4, 2025

View reviewed changes

kfaraz merged commit cac8b9d into apache:master Mar 4, 2025
75 checks passed

This was referenced Mar 5, 2025

[OBSDATA-8872]Add maxInterval to kill config and make kill tasks efficient confluentinc/druid#309

Open

[OBSDATA-8872]Add maxInterval to kill config confluentinc/druid#308

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restrict segment metadata kill query till maxInterval from last kill task time #17770

Restrict segment metadata kill query till maxInterval from last kill task time #17770

chetanpatidar26 commented Mar 3, 2025 •

edited

Loading

kfaraz left a comment

kfaraz left a comment

chetanpatidar26 commented Mar 4, 2025

Restrict segment metadata kill query till maxInterval from last kill task time #17770

Restrict segment metadata kill query till maxInterval from last kill task time #17770

Conversation

chetanpatidar26 commented Mar 3, 2025 • edited Loading

Description

Proposed fix

case 1

case 2

Release note

kfaraz left a comment

Choose a reason for hiding this comment

kfaraz left a comment

Choose a reason for hiding this comment

chetanpatidar26 commented Mar 4, 2025

chetanpatidar26 commented Mar 3, 2025 •

edited

Loading