Skip to content

Workflow Health Dashboard - 2026-03-09 #20171

@github-actions

Description

@github-actions

Overview

Metric Value
Total executable workflows 166 (stable)
Compiled with lock files 166/166 (100% ✅)
Outdated lock files 0 ✅ (13 with 0s diffs are checkout artifacts, same false-positive as Mar 8)
Healthy ~153 (92%)
Critical/Failing (P1) 6 workflows
Overall health score 72/100 (↓4 from 76 — P2 failure spike today)

⚠️ DEGRADED — Lockdown failures week 5+ + OpenAI restriction day 12. Previous dashboard #20036 expired at 07:29Z today.


Critical Issues 🚨

P1: Lockdown Token Missing (4 workflows, ongoing week 5+)

All 4 workflows require GH_AW_GITHUB_TOKEN which is not provisioned. All fix paths closed (#17414, #17807 both CLOSED "not_planned"). No current fix path — manual admin intervention required.

P1: AI Moderator — Day 12 OpenAI Restriction

P1: Smoke Codex — Day 12 OpenAI Restriction


New Failures (P2) 📋

8 new auto-generated failure issues (Mar 8–9)
Issue Workflow Pattern
#20158 Agent Container Smoke Test ⚠️ No safe outputs generated
#20156 Duplicate Code Detector Standard failure
#20154 Multi-Device Docs Tester ⚠️ No safe outputs generated
#20153 GPL Dependency Cleaner (gpclean) Standard failure
#20152 Agent Persona Explorer Pre-agent failure
#20142 Smoke Update Cross-Repo PR Pre-agent failure
#20102 Security Alert Burndown Pre-agent failure
#20046 Daily Code Metrics ⚠️ Repo-memory push fail

Also: #20037 (Workflow Health Manager itself) — repo-memory push fail on previous run.

Notable patterns:

  • 2 workflows showing "No Safe Outputs Generated" — possible safe-output infrastructure issue
  • 2 workflows with repo-memory push failures — memory size limit enforcement issue
  • Several pre-agent failures — investigate if these are recurring or one-off

Issue Tracking Summary

Workflow Status Tracking Issue
Issue Monster ❌ Failing Auto-generates its own issues
PR Triage Agent ❌ Failing None (expired)
Daily Issues Report ❌ Failing None (expired)
Org Health Report ❌ Failing None (never had one)
AI Moderator ❌ Partial #20113 ✅ OPEN
Smoke Codex ❌ Failing #19514 ✅ OPEN (exp Mar 11)

Compilation Health ✅

All 166 workflows have .lock.yml files. The 13 detected "outdated" lock files all show 0-second timestamp differences — confirmed filesystem checkout artifacts (same false-positive pattern as seen Mar 8).


Healthy Workflows ✅

Key healthy workflows

Systemic Issues

Lockdown Token (GH_AW_GITHUB_TOKEN) — Week 5+

  • Pattern: Chronic failure, all fix paths declined
  • Recommendation: Accept as known failure or escalate to admin

OpenAI Cybersecurity Restriction — Day 12

Repo-Memory Push Failures — NEW

  • Affected: Workflow Health Manager, Daily Code Metrics (at minimum)
  • Pattern: push_repo_memory validation tool appears to include .git directory objects in size calculation, making the configured 10KB limit unachievable
  • Impact: Memory not persisted between runs; reduces coordination between meta-orchestrators

Health Trends

Date Score Key Change
2026-03-01 73/100 Metrics Collector regression
2026-03-03 76/100 Metrics Collector recovered
2026-03-07 74/100 False positive: 12 "outdated" locks
2026-03-08 76/100 Corrected false positive; all locks current
2026-03-09 72/100 P2 spike: 8 new failure issues

Recommendations

High Priority

  1. Lockdown workflows ([P1] Lockdown mode failing: GH_AW_GITHUB_TOKEN not configured — 5 workflows affected #17414, [q] fix(workflows): remove explicit lockdown:true to stop recurring failures #17807 closed) — requires admin escalation; 4 workflows failing indefinitely
  2. OpenAI restriction — Smoke Codex/AI Moderator Day 12; escalate model switch in [aw] Smoke Codex failed (pre-agent) #19514
  3. Repo-memory push validation bugpush_repo_memory counts .git objects, making limit unachievable; needs fix

Medium Priority

  1. Investigate "No Safe Outputs Generated" pattern (Agent Container, Multi-Device Docs)
  2. Monitor new P2 failures — if recurring, create dedicated tracking issues

Actions Taken This Run

  • ✅ Verified 166/166 workflows compiled
  • ✅ Confirmed 0 real outdated lock files (13 false positives)
  • ✅ Confirmed P1 status: all 6 workflows still failing
  • ✅ Identified 8 new P2 failures (auto-tracked by issue-monster)
  • ✅ Identified repo-memory push validation issue
  • ✅ Created this dashboard (replacing Workflow Health Dashboard - 2026-03-08 #20036 which expired 07:29Z)

References:

  • expires on Mar 10, 2026, 7:33 AM UTC

Generated by Workflow Health Manager - Meta-Orchestrator ·

  • expires on Mar 10, 2026, 7:44 AM UTC

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions