Skip to content

fix(monitors): aggregate service status across active monitors#3

Open
mark-mckee wants to merge 1 commit into
lanbugs:mainfrom
mark-mckee:upstream/aggregation-clean
Open

fix(monitors): aggregate service status across active monitors#3
mark-mckee wants to merge 1 commit into
lanbugs:mainfrom
mark-mckee:upstream/aggregation-clean

Conversation

@mark-mckee
Copy link
Copy Markdown

When multiple monitors are attached to the same Service, _apply_service_status currently overwrites whatever the last check-in produced. A recovery in one monitor masks an active outage on another.

This adds _compute_service_status which computes the worst (per _STATUS_PRIORITY) across all active monitors on the service, treating the just-completed monitor's candidate as that monitor's contribution (its last_status may not be persisted yet).

Service status is updated only when the aggregated state actually changes, so single-monitor services are unaffected. Maintenance mode and admin/API overrides also untouched.

When multiple monitors are attached to the same Service,
_apply_service_status currently overwrites whatever the last check-in
produced. A recovery in one monitor masks an active outage on another.

This adds _compute_service_status which computes the worst (per
_STATUS_PRIORITY) across all active monitors on the service, treating
the just-completed monitor's candidate as that monitor's contribution
(its last_status may not be persisted yet).

Service status is updated only when the aggregated state actually
changes, so single-monitor services are unaffected. Maintenance mode
and admin/API overrides also untouched.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant