📊 Workflow Health Dashboard - 2026-01-04

# Workflow Health Dashboard - 2026-01-04

## Overview

- **Total Workflows**: 128
- **Healthy**: Unable to determine (no metrics data) 🔴
- **Warning**: 10 (7.8%) - Outdated lock files ⚠️
- **Critical**: 2 issues identified 🚨
- **Inactive**: Unknown (no metrics data)

## Critical Issues 🚨

### Issue 1: Metrics Collection System Down
- **Status**: No execution metrics available
- **Error**: `/tmp/gh-aw/repo-memory-default/memory/default/metrics/latest.json` does not exist
- **Impact**: Cannot monitor workflow health, success rates, or failure patterns
- **Root Cause**: `metrics-collector.md` workflow has outdated lock file
- **Action**: Issue created for enabling metrics collection
- **Priority**: P0

### Issue 2: 10 Workflows with Outdated Lock Files (7.8%)
- **Status**: Source `.md` files modified after `.lock.yml` compilation
- **Impact**: Runtime behavior may not match source code
- **Affected Workflows**:
  - `smoke-copilot-playwright.md`
  - `go-fan.md`
  - `stale-repo-identifier.md`
  - `duplicate-code-detector.md`
  - `copilot-pr-nlp-analysis.md`
  - `smoke-srt.md`
  - `github-mcp-structural-analysis.md`
  - `metrics-collector.md` ⚠️ Critical
  - `incident-response.md`
  - `layout-spec-maintainer.md`
- **Action**: Issue created for recompilation
- **Priority**: P0

## Structural Health Analysis ✅

Since execution metrics are unavailable, this assessment focuses on structural health:

### Compilation Coverage
- ✅ **100% coverage**: All 128 workflows have `.lock.yml` files
- ⚠️ **7.8% outdated**: 10 workflows need recompilation

### Engine Distribution
| Engine | Count | Percentage | Status |
|--------|-------|------------|--------|
| Copilot | 69 | 53.9% | ✅ Healthy diversity |
| Claude | 25 | 19.5% | ✅ Good alternative |
| Codex | 7 | 5.5% | ✅ Specialized use |
| Other | 27 | 21.1% | ⚠️ Needs classification |

**Analysis**: Healthy distribution prevents single point of failure. Copilot as primary engine is appropriate for GitHub integration.

### Workflow Categories
| Category | Count | Notes |
|----------|-------|-------|
| Campaign Workflows | 2 | Campaign orchestration |
| Smoke Tests | 10 | Testing infrastructure |
| Daily Scheduled | 17 | Regular maintenance |
| Weekly Scheduled | 1 | Long-term analysis |
| Hourly Scheduled | 1 | High-frequency monitoring |
| Event-Triggered | ~97 | Majority of workflows |

**Analysis**: Good balance of scheduled vs. event-triggered workflows. Scheduling spread reduces resource contention.

### Tool Usage
| Tool | Workflows | Coverage | Status |
|------|-----------|----------|--------|
| GitHub MCP | 94 | 73% | ✅ Excellent adoption |
| Playwright | 11 | 9% | ✅ Appropriate for UI testing |
| Fetch | 8 | 6% | ✅ Web content retrieval |

**Analysis**: Heavy GitHub MCP usage is expected and healthy for repository operations.

## Systemic Patterns

### Positive Indicators ✅
1. **Complete compilation coverage**: All workflows have lock files
2. **Strong naming conventions**: Clear categorization (daily-*, smoke-*, etc.)
3. **Engine diversity**: Multiple engines prevent vendor lock-in
4. **Standardized tooling**: Widespread GitHub MCP adoption
5. **No orphaned lock files**: Clean 1:1 mapping between source and compiled files

### Areas of Concern ⚠️
1. **Meta-monitoring gap**: Metrics collector itself is outdated
2. **No execution visibility**: Cannot assess runtime health
3. **Missing metrics infrastructure**: Need 7 days of data for trends
4. **Safe outputs visibility**: Frontmatter declarations appear missing

## Data Limitations 🔴

**Current Analysis Limited By**:
- ❌ No workflow execution metrics
- ❌ No failure rate data
- ❌ No runtime performance data
- ❌ No error pattern analysis
- ❌ Cannot calculate MTBF
- ❌ Cannot identify failing workflows

**Reason**: Metrics Collector workflow is outdated and metrics storage not populated.

**Impact**: This assessment can only evaluate structural health (compilation, configuration, categorization). Runtime health monitoring requires metrics data.

## Recommendations

### Immediate Actions (P0)
1. ✅ **Recompile outdated workflows** - Issue created
2. ✅ **Enable metrics collection** - Issue created
3. ⏳ **Verify metrics collection** - Pending workflow fix
4. ⏳ **Wait for baseline data** - Need 7 days of metrics

### High Priority (P1)
1. **Establish monitoring alerts** - Set up notifications for workflow failures
2. **Document workflow dependencies** - Map inter-workflow relationships
3. **Verify safe outputs usage** - Deep dive into workflow bodies

### Medium Priority (P2)
1. **Analyze execution patterns** - Once metrics available
2. **Optimize scheduling** - Prevent resource contention
3. **Review smoke test coverage** - Ensure critical paths tested

### Low Priority (P3)
1. **Standardize frontmatter** - Consistent metadata across workflows
2. **Add workflow descriptions** - Improve discoverability
3. **Document engine selection** - Guidelines for choosing engines

## Actions Taken This Run

- ✅ Scanned 128 executable workflows
- ✅ Verified 100% compilation coverage
- ✅ Identified 10 outdated lock files
- ✅ Created 2 P0 issues for critical problems
- ✅ Saved analysis to shared repo memory
- ✅ Created coordination alerts for other meta-orchestrators

## Trends

- **Overall health score**: Unable to calculate (no metrics data)
- **Compilation health**: 92.2% (118/128 up-to-date)
- **New failures this week**: Unknown (no metrics)
- **Fixed issues this week**: Unknown (no metrics)
- **Average success rate**: Unknown (no metrics)

## Next Steps

1. ⏳ Monitor recompilation issue resolution
2. ⏳ Monitor metrics collection enablement
3. ⏳ Wait 7 days for metrics baseline
4. 🔄 Re-run comprehensive health analysis with execution data
5. 🔄 Establish ongoing monitoring and alerting

## Success Metrics Target

Once metrics are available, track:
- Overall workflow health score > 80/100
- Workflow success rate > 90%
- Mean time between failures (MTBF) > 7 days
- Outdated lock files < 5%
- Failed workflows detected within 24 hours

---

> **Last updated**: 2026-01-04T02:59:53Z  
> **Next check**: After metrics collection enabled (7 days minimum for baseline)  
> **Dashboard maintained by**: Workflow Health Manager  
> **Shared memory**: `/tmp/gh-aw/repo-memory/default/workflow-health-latest.md`





> AI generated by [Workflow Health Manager - Meta-Orchestrator](https://github.com/githubnext/gh-aw/actions/runs/20686646526)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

📊 Workflow Health Dashboard - 2026-01-04 #8825

Workflow Health Dashboard - 2026-01-04

Overview

Critical Issues 🚨

Issue 1: Metrics Collection System Down

Issue 2: 10 Workflows with Outdated Lock Files (7.8%)

Structural Health Analysis ✅

Compilation Coverage

Engine Distribution

Workflow Categories

Tool Usage

Systemic Patterns

Positive Indicators ✅

Areas of Concern ⚠️

Data Limitations 🔴

Recommendations

Immediate Actions (P0)

High Priority (P1)

Medium Priority (P2)

Low Priority (P3)

Actions Taken This Run

Trends

Next Steps

Success Metrics Target

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Engine	Count	Percentage	Status
Copilot	69	53.9%	✅ Healthy diversity
Claude	25	19.5%	✅ Good alternative
Codex	7	5.5%	✅ Specialized use
Other	27	21.1%	⚠️ Needs classification

Category	Count	Notes
Campaign Workflows	2	Campaign orchestration
Smoke Tests	10	Testing infrastructure
Daily Scheduled	17	Regular maintenance
Weekly Scheduled	1	Long-term analysis
Hourly Scheduled	1	High-frequency monitoring
Event-Triggered	~97	Majority of workflows

Tool	Workflows	Coverage	Status
GitHub MCP	94	73%	✅ Excellent adoption
Playwright	11	9%	✅ Appropriate for UI testing
Fetch	8	6%	✅ Web content retrieval

📊 Workflow Health Dashboard - 2026-01-04 #8825

Description

Workflow Health Dashboard - 2026-01-04

Overview

Critical Issues 🚨

Issue 1: Metrics Collection System Down

Issue 2: 10 Workflows with Outdated Lock Files (7.8%)

Structural Health Analysis ✅

Compilation Coverage

Engine Distribution

Workflow Categories

Tool Usage

Systemic Patterns

Positive Indicators ✅

Areas of Concern ⚠️

Data Limitations 🔴

Recommendations

Immediate Actions (P0)

High Priority (P1)

Medium Priority (P2)

Low Priority (P3)

Actions Taken This Run

Trends

Next Steps

Success Metrics Target

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions