Skip to content

feat(eval): composable quality gates with auto-remediation triggers #334

@christso

Description

@christso

Status Update (2026-04-05)

This issue remains valid as a core evaluation/CLI capability issue.

The earlier Studio expansion is no longer the default plan. Quality enforcement today is primarily handled by AgentV eval thresholds + GitHub Actions CI, so the dashboard-management portion of this issue should be treated as optional follow-up work, not part of the critical path.

Revised Scope

In scope

  • severity: error | warning | info on evaluator configs
  • non-blocking warnings / informational outcomes in result data
  • optional remediation hooks if they are clearly useful in CLI/automation workflows
  • reusable gate patterns via code graders / scripts / packages

Deprioritized

  • Dashboard-driven gate CRUD
  • Visual threshold editor in Studio
  • Alert routing / remediation center in Studio
  • Gate compliance dashboards in Studio

Notes

If Studio ever needs a UI for this later, it should come after the core primitives exist and only if CI + CLI workflows prove insufficient.

Acceptance Signals

  • severity: warning|error|info works on evaluator configs
  • warnings are visible in output/result data without failing the eval
  • errors continue to block the eval as today
  • any remediation hook design is usable from CLI/automation flows

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    Status

    Done

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions