Create a goroutine worker pool to send data from distributors to ingesters. #6406

alanprot · 2024-12-07T00:15:45Z

What this PR does:
Implement an option to use a goroutine worker pool to handle requests from the distributor to ingesters, rather than spawning a new goroutine for each request.

This PR was inspired on: grpc/grpc-go#3204

Looking at the distributor flame graph, we could see that lots of cpu was being used on the runtime.newstack call:

This problem increases with the RemoteWrite TPS and the number of ingesters in the cluster (as, for request and ingesters, we create a new go routine).

With this PR enabled, we could see a reduction on 20% of CPU on some cortex cluster with hundreds of ingesters.

We could not see any runtime.newstack on the cpu flamegraph after the option was enabled.

Which issue(s) this PR fixes:
Fixes #

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

alanprot · 2024-12-07T00:19:05Z

pkg/ring/batch.go

 	for _, i := range instances {
-		go func(i instance) {
+		e.Submit(func() {
 			err := callback(i.desc, i.indexes)
 			tracker.record(i, err)
 			wg.Done()
-		}(i)
+		})


The closure var should be fine now:

https://tip.golang.org/doc/go1.22

Previously, the variables declared by a “for” loop were created once and updated by each iteration. In Go 1.22, each iteration of the loop creates new variables, to avoid accidental sharing bugs. The transition support tooling described in the proposal continues to work in the same way it did in Go 1.21.

yeya24

Great work.
Is it worth a changelog? And experimental feature?

pkg/distributor/distributor.go

Signed-off-by: alanprot <[email protected]>

alanprot · 2024-12-10T00:23:21Z

Great work. Is it worth a changelog? And experimental feature?

I forgot about changelog. :P

Yeah.. lemme mark as experimental.

Signed-off-by: alanprot <[email protected]>

pull-request-size bot added the size/L label Dec 7, 2024

alanprot commented Dec 7, 2024

View reviewed changes

alanprot force-pushed the distributor-worker-pool branch from bb63874 to 2a0e956 Compare December 7, 2024 00:20

alanprot marked this pull request as ready for review December 7, 2024 00:22

dosubot bot added component/distributor type/performance labels Dec 7, 2024

alanprot force-pushed the distributor-worker-pool branch 2 times, most recently from ea066bd to 3c5ff63 Compare December 7, 2024 00:35

yeya24 approved these changes Dec 9, 2024

View reviewed changes

pkg/distributor/distributor.go Outdated Show resolved Hide resolved

alanprot added 2 commits December 9, 2024 16:22

Creating a worker pool to be used on distributors

45db5fd

Signed-off-by: alanprot <[email protected]>

metric + test

4bbfa0c

Signed-off-by: alanprot <[email protected]>

alanprot force-pushed the distributor-worker-pool branch from 3c5ff63 to 4bbfa0c Compare December 10, 2024 00:22

Changelog

ff626a5

Signed-off-by: alanprot <[email protected]>

CharlieTLe approved these changes Dec 10, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Dec 10, 2024

alanprot merged commit 5d2dac0 into cortexproject:master Dec 10, 2024
16 checks passed

alanprot deleted the distributor-worker-pool branch December 10, 2024 01:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create a goroutine worker pool to send data from distributors to ingesters. #6406

Create a goroutine worker pool to send data from distributors to ingesters. #6406

alanprot commented Dec 7, 2024 •

edited

Loading

alanprot Dec 7, 2024

yeya24 left a comment

alanprot commented Dec 10, 2024

Create a goroutine worker pool to send data from distributors to ingesters. #6406

Create a goroutine worker pool to send data from distributors to ingesters. #6406

Conversation

alanprot commented Dec 7, 2024 • edited Loading

alanprot Dec 7, 2024

Choose a reason for hiding this comment

yeya24 left a comment

Choose a reason for hiding this comment

alanprot commented Dec 10, 2024

alanprot commented Dec 7, 2024 •

edited

Loading