Weight-class configuration for LeastWeighted target allocation strategy by suprjinx · Pull Request #4704 · open-telemetry/opentelemetry-operator

suprjinx · 2026-02-16T19:13:26Z

Description:

This PR implements weight-based target allocation for the least-weighted strategy, so that heavier targets are spread across collectors rather than piling up on one. In the absence of weight information, the strategy falls back to existing NumTargets balancing.

How to assign weights to targets

Add the opentelemetry.io/target-allocation-weight annotation to the target pod with an integer value from 1 to 100:

metadata:
  annotations:
    opentelemetry.io/target-allocation-weight: "10"

For pods you don't control (e.g. kube-state-metrics, nginx-ingress), you can patch the deployment's pod template:

kubectl patch deployment nginx-ingress -p '{"spec":{"template":{"metadata":{"annotations":{"opentelemetry.io/target-allocation-weight":"10"}}}}}'

Kubernetes SD automatically surfaces this as the meta label __meta_kubernetes_pod_annotation_opentelemetry_io_target_allocation_weight, which the allocator reads directly.

Weight resolution

Pod annotation value (integer 1-100) — if present and valid
Default: 1

Values outside the 1-100 range, non-numeric, or missing annotations all fall back to the default weight of 1.

How it works

The least-weighted strategy tracks WeightedLoad (sum of weights) per collector instead of just target count. When assigning a new target, it picks the collector with the lowest WeightedLoad. This ensures heavy targets get spread across collectors.

Link to tracking Issue(s):

Relates: An algorithm for new target allocation, based on the average job algorithm. #3128

Testing:
Unit tests covering: weighted load balancing, unlabeled targets defaulting to weight 1, invalid annotations (non-numeric, zero, negative, >100), and balanced distribution of heavy targets.

Documentation:
Needed

PR Assisted by Claude Code

Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

suprjinx · 2026-02-16T19:17:31Z

cmd/otel-allocator/internal/prehook/relabel.go

 				tItem.JobName,
 				tItem.TargetURL,
-				tItem.Labels,
+				builder.Labels(),


this is a significant change -- the filtered labels were not persisted to the item previously

I understand the goal of this PR, but this change could indeed be decently dangerous. I have to spend some time thinking through the consequences of this. cc @swiatekm

There's no way we can do this, it's a major breaking change. My original thought was that we were going to use a Pod annotation, because we get those straight from K8s service discovery, and don't need relabeling.

There's no way we can do this, it's a major breaking change. My original thought was that we were going to use a Pod annotation, because we get those straight from K8s service discovery, and don't need relabeling.

Thanks @swiatekm @jaronoff97 -- I will remove the relabeling. I can use pod annotation, but i think it will also require some new config option for TA (in order to supply a weight config for Pods we don't control)

Ready for another look, with TA CR changes for WeightOverrides

Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

github-actions · 2026-02-17T16:06:48Z

E2E Test Results

34 files ±0 247 suites ±0 3h 0m 45s ⏱️ + 43m 44s
95 tests ±0 94 ✅ - 1 0 💤 ±0 1 ❌ +1
251 runs ±0 250 ✅ - 1 0 💤 ±0 1 ❌ +1

For more details on these failures, see this check.

Results for commit d13f23b. ± Comparison against base commit ebb6ab9.

♻️ This comment has been updated with latest results.

defaults. Also now uses "__target_allocation_weight". Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

…uprjinx/opentelemetry-operator into target-allocation-weight-classes

Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

swiatekm · 2026-02-27T15:04:09Z

The changes in this PR look broadly correct, but do more than I'm comfortable with without further discussion about the API and how this feature should work in the long term. Originally, the problem we wanted solved was that jobs of a particular class should be evenly spread amongst collectors. The simple solution I envisioned for this was to set the class as a Pod annotation or label and let the strategy use that information, the same way it currently does with the job name.

Here, we're now adding CR-level overrides and tracking weight as if it's a meaningful number. I'm not completely against doing that, but it requires a more careful design, especially if we're going to make it configurable at the CR level.

@suprjinx do you think we can cut this down to a more basic version while still keeping it useful for you?

suprjinx · 2026-02-27T16:32:59Z

The changes in this PR look broadly correct, but do more than I'm comfortable with without further discussion about the API and how this feature should work in the long term. Originally, the problem we wanted solved was that jobs of a particular class should be evenly spread amongst collectors. The simple solution I envisioned for this was to set the class as a Pod annotation or label and let the strategy use that information, the same way it currently does with the job name.

Here, we're now adding CR-level overrides and tracking weight as if it's a meaningful number. I'm not completely against doing that, but it requires a more careful design, especially if we're going to make it configurable at the CR level.

@suprjinx do you think we can cut this down to a more basic version while still keeping it useful for you?

@swiatekm so you'd suggest consuming an annotation, configured in podSpec, but no ability to configure weight for kube-state-metrics etc? I think the latter is where we see the greatest imbalance in load on collectors.

swiatekm · 2026-02-27T16:46:02Z

The changes in this PR look broadly correct, but do more than I'm comfortable with without further discussion about the API and how this feature should work in the long term. Originally, the problem we wanted solved was that jobs of a particular class should be evenly spread amongst collectors. The simple solution I envisioned for this was to set the class as a Pod annotation or label and let the strategy use that information, the same way it currently does with the job name.
Here, we're now adding CR-level overrides and tracking weight as if it's a meaningful number. I'm not completely against doing that, but it requires a more careful design, especially if we're going to make it configurable at the CR level.
@suprjinx do you think we can cut this down to a more basic version while still keeping it useful for you?

@swiatekm so you'd suggest consuming an annotation, configured in podSpec, but no ability to configure weight for kube-state-metrics etc? I think the latter is where we see the greatest imbalance in load on collectors.

If what you're actually trying to do is balance based on a statically assigned numeric load, so whichever collector gets kube-state-metrics doesn't get a lot of other targets, then I'm not sure we want to do that. It's a fairly specific use case and something of an anti-pattern for Prometheus in general - you should try to avoid having massive targets. I don't think an unreasonable suggestion would be to either run a separate collector instance just for kube-state-metrics or shard it, either as a StatefulSet or Daemonset. Is that not an option for you?

just expect a weight int in Pod annotation Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

suprjinx · 2026-02-27T19:04:16Z

The changes in this PR look broadly correct, but do more than I'm comfortable with without further discussion about the API and how this feature should work in the long term. Originally, the problem we wanted solved was that jobs of a particular class should be evenly spread amongst collectors. The simple solution I envisioned for this was to set the class as a Pod annotation or label and let the strategy use that information, the same way it currently does with the job name.
Here, we're now adding CR-level overrides and tracking weight as if it's a meaningful number. I'm not completely against doing that, but it requires a more careful design, especially if we're going to make it configurable at the CR level.
@suprjinx do you think we can cut this down to a more basic version while still keeping it useful for you?

@swiatekm so you'd suggest consuming an annotation, configured in podSpec, but no ability to configure weight for kube-state-metrics etc? I think the latter is where we see the greatest imbalance in load on collectors.

If what you're actually trying to do is balance based on a statically assigned numeric load, so whichever collector gets kube-state-metrics doesn't get a lot of other targets, then I'm not sure we want to do that. It's a fairly specific use case and something of an anti-pattern for Prometheus in general - you should try to avoid having massive targets. I don't think an unreasonable suggestion would be to either run a separate collector instance just for kube-state-metrics or shard it, either as a StatefulSet or Daemonset. Is that not an option for you?

@swiatekm I removed all the API changes so this is now just an optional annotation. Also, now expecting a numeric weight (1-100) rather than a class name.

Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

…enly Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

… "weight" Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

…uprjinx/opentelemetry-operator into target-allocation-weight-classes

Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

Weight-class configuration for LeastWeighted target allocation strategy

1ab0628

Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

suprjinx commented Feb 16, 2026

View reviewed changes

Add chlog

d61b3fe

Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

suprjinx added 2 commits February 17, 2026 11:54

Remove weight-class configs in favor of some reasonable, hard-coded

e443e66

defaults. Also now uses "__target_allocation_weight". Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

fix lint/fmt

d1d03aa

Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

suprjinx marked this pull request as ready for review February 17, 2026 18:26

suprjinx requested a review from a team as a code owner February 17, 2026 18:26

suprjinx added 5 commits February 17, 2026 14:30

Merge branch 'main' into target-allocation-weight-classes

36ef8d7

Merge branch 'main' into target-allocation-weight-classes

5785ad5

Reimplement with pod annotation + "weight overrides" in TA config

45f8d06

Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

Merge branch 'target-allocation-weight-classes' of ssh://github.com/s…

a228bd8

…uprjinx/opentelemetry-operator into target-allocation-weight-classes

Add WeightOverrides to the CR

06774b9

Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

Remove CR changes for weight overrides; remove weight classes and now

36df73f

just expect a weight int in Pod annotation Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

suprjinx added 12 commits February 27, 2026 14:06

Put upper limit of 100 on annotation value

5592f0e

Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

Update chlog for annotation-only impl

e3778fc

Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

Merge branch 'main' into target-allocation-weight-classes

b5cb17e

Remove diff on manifests

39fa6c4

Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

Remove unrelated diffs

cb0c36d

Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

Test comment and assertion was incorrect, heavy jobs should spread ev…

01eb0fc

…enly Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

Removed a couple of mentions of "weight class" since it's really just…

0071b4b

… "weight" Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

Merge branch 'main' into target-allocation-weight-classes

d73b455

Merge branch 'main' into target-allocation-weight-classes

b5e795f

Merge branch 'main' into target-allocation-weight-classes

9db568c

Merge branch 'target-allocation-weight-classes' of ssh://github.com/s…

8ce81bf

…uprjinx/opentelemetry-operator into target-allocation-weight-classes

Fix linter complaint unused receiver

7d4dc12

suprjinx added 2 commits March 5, 2026 12:53

Merge branch 'target-allocation-weight-classes' of ssh://github.com/s…

879fe1f

…uprjinx/opentelemetry-operator into target-allocation-weight-classes

Followed linter suggestion to fix, but another fix needed

d13f23b

Signed-off-by: Geoff Wilson <geoff@gr-oss.io>

swiatekm added the discuss-at-sig This issue or PR should be discussed at the next SIG meeting label Mar 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Weight-class configuration for LeastWeighted target allocation strategy#4704

Weight-class configuration for LeastWeighted target allocation strategy#4704
suprjinx wants to merge 24 commits intoopen-telemetry:mainfrom
suprjinx:target-allocation-weight-classes

suprjinx commented Feb 16, 2026 •

edited

Loading

Uh oh!

suprjinx Feb 16, 2026

Uh oh!

jaronoff97 Feb 19, 2026

Uh oh!

swiatekm Feb 19, 2026

Uh oh!

suprjinx Feb 23, 2026

Uh oh!

suprjinx Feb 23, 2026

Uh oh!

github-actions bot commented Feb 17, 2026 •

edited

Loading

Uh oh!

swiatekm commented Feb 27, 2026

Uh oh!

suprjinx commented Feb 27, 2026

Uh oh!

swiatekm commented Feb 27, 2026

Uh oh!

suprjinx commented Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

suprjinx commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

How to assign weights to targets

Weight resolution

How it works

Uh oh!

suprjinx Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

jaronoff97 Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

swiatekm Feb 19, 2026

Choose a reason for hiding this comment

Uh oh!

suprjinx Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

suprjinx Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

E2E Test Results

Uh oh!

swiatekm commented Feb 27, 2026

Uh oh!

suprjinx commented Feb 27, 2026

Uh oh!

swiatekm commented Feb 27, 2026

Uh oh!

suprjinx commented Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

suprjinx commented Feb 16, 2026 •

edited

Loading

github-actions bot commented Feb 17, 2026 •

edited

Loading