Skip to content

ROX-33370: Error budget exhaustion alert is fired for a small downtime#335

Merged
kovayur merged 1 commit intomasterfrom
yury/ROX-33370-fix-error-budget-exhaustion-alerts
Mar 10, 2026
Merged

ROX-33370: Error budget exhaustion alert is fired for a small downtime#335
kovayur merged 1 commit intomasterfrom
yury/ROX-33370-fix-error-budget-exhaustion-alerts

Conversation

@kovayur
Copy link
Copy Markdown
Contributor

@kovayur kovayur commented Mar 9, 2026

Problem: Error budget exhaustion alert is fired for a small downtime after the budget is already exhausted
Changes:

  • Replace min_over_time(central:sli:availability[1h]) with central:slo:availability:burnrate1h >= 15 for the error budget exhaustion alerts to accept small downtimes due to pod restarts (cluster or operator upgrades).
  • Remove min_over_time(central:sli:availability[1h]) for the error burn rate alert because this condition is basically equivalent to central:slo:availability:burnrate1h > 0 which will be always true if the other condition is true (burnrate1h > 30)

@kovayur kovayur requested a review from a team as a code owner March 9, 2026 17:28
@kovayur kovayur requested review from ebensh and johannes94 and removed request for a team and ebensh March 9, 2026 17:28
@kovayur kovayur changed the title ROX-33370: Error budget exhaustion alert is fired for a small downtime after the budget is already exhausted ROX-33370: Error budget exhaustion alert is fired for a small downtime Mar 9, 2026
@kovayur kovayur merged commit 4648a91 into master Mar 10, 2026
1 check passed
@kovayur kovayur deleted the yury/ROX-33370-fix-error-budget-exhaustion-alerts branch March 10, 2026 11:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants