fix: reorder histogram samples in multiprocess prometheus output by saivedant169 · Pull Request #5570 · bentoml/BentoML

saivedant169 · 2026-03-14T21:12:44Z

What this PR does

BentoML's /metrics endpoint produces histogram metrics in the wrong sample order when running in multiprocess mode. The _sum and _count lines appear before _bucket entries, which violates the Prometheus exposition text format and breaks spec-compliant parsers like fluent-bit's prometheus_scrape.

Before (broken):

# TYPE http_request_duration_seconds histogram
http_request_duration_seconds_sum{...} 0.05
http_request_duration_seconds_count{...} 1
http_request_duration_seconds_bucket{le="0.005",...} 0
http_request_duration_seconds_bucket{le="0.01",...} 0
http_request_duration_seconds_bucket{le="+Inf",...} 1

After (correct):

# TYPE http_request_duration_seconds histogram
http_request_duration_seconds_bucket{le="0.005",...} 0
http_request_duration_seconds_bucket{le="0.01",...} 0
http_request_duration_seconds_bucket{le="+Inf",...} 1
http_request_duration_seconds_count{...} 1
http_request_duration_seconds_sum{...} 0.05

Root cause

prometheus_client's MultiProcessCollector._accumulate_metrics() processes _sum/_count samples before _bucket entries (buckets go through a separate accumulation pass), and Python's dict insertion order makes them appear first in the output. Single-process mode doesn't have this issue because the underlying metric objects maintain correct sample order.

The fix

After MultiProcessCollector collects metrics, sort histogram samples by:

Non-le labels (to preserve label-set grouping)
Suffix order: _bucket → _count → _sum
le value (ascending) within buckets

This only applies in multiprocess mode since the issue doesn't affect single-process collection.

Coordination

Commented on #5386 here with root cause analysis.

MultiProcessCollector inserts _sum/_count samples before _bucket entries due to dict insertion order in its accumulation logic. This violates the Prometheus exposition text format spec and breaks parsers like fluent-bit's prometheus_scrape. Sort histogram samples after collection so _bucket entries (ascending le) come before _count and _sum, grouped by label set. Fixes bentoml#5386

frostming · 2026-03-25T00:27:26Z

Thank you

saivedant169 requested a review from a team as a code owner March 14, 2026 21:12

saivedant169 requested review from jianshen92 and removed request for a team March 14, 2026 21:12

frostming approved these changes Mar 24, 2026

View reviewed changes

frostming merged commit 0772581 into bentoml:main Mar 25, 2026
49 of 51 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: reorder histogram samples in multiprocess prometheus output#5570

fix: reorder histogram samples in multiprocess prometheus output#5570
frostming merged 1 commit into
bentoml:mainfrom
saivedant169:fix/prometheus-histogram-ordering

saivedant169 commented Mar 14, 2026

Uh oh!

Uh oh!

frostming commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

saivedant169 commented Mar 14, 2026

What this PR does

Root cause

The fix

Coordination

Uh oh!

Uh oh!

frostming commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants