CI Hardening — 19 Opportunities Identified
Flight audited all 15 workflows on bradygaster/squad and identified 19 hardening opportunities.
Full analysis: docs/proposals/ci-hardening-opportunities.md
Priority: P0
Quick Wins (2-3 hours, high impact)
- Retry logic on npm install — eliminates ~20-30% of transient CI failures
- Job timeout tuning — prevents 6-hour hangs on stuck jobs
- npm cache optimization — 30-40% speedup on install steps
- Conditional docs quality checks — skip on code-only PRs (10-15% faster)
- Publish secret validation — fail fast on misconfiguration
Short-term (5-6 hours, medium ROI)
- Lockfile stability in preflight
- Consolidate duplicate steps (create composite action)
- Ralph cron review
- API rate limit monitoring
- Registry health checks
Backlog (strategic, higher complexity)
- Cross-platform CI (Windows/macOS/Linux matrix)
- Security scanning (SAST + npm audit)
- Performance regression detection
- E2E CLI integration tests
- Flaky test quarantine system
Routing
- Booster owns implementation (CI/CD Engineer)
- FIDO monitors for flaky test patterns (Quality Owner)
- Flight approves + triages Phase 3 against v1.0 roadmap
Files Needing Changes
| File |
Changes |
Effort |
squad-ci.yml |
Add npm cache, retry, timeout, conditional Playwright |
30 min |
squad-npm-publish.yml |
Add retry, secret validation |
20 min |
squad-insider-publish.yml |
Add retry in matrix |
15 min |
ci-rerun.yml |
Add npm cache |
15 min |
CI Hardening — 19 Opportunities Identified
Flight audited all 15 workflows on bradygaster/squad and identified 19 hardening opportunities.
Full analysis:
docs/proposals/ci-hardening-opportunities.mdPriority: P0
Quick Wins (2-3 hours, high impact)
Short-term (5-6 hours, medium ROI)
Backlog (strategic, higher complexity)
Routing
Files Needing Changes
squad-ci.ymlsquad-npm-publish.ymlsquad-insider-publish.ymlci-rerun.yml