Skip to content

CI Hardening: 19 opportunities identified (5 quick wins) #121

@diberry

Description

@diberry

CI Hardening — 19 Opportunities Identified

Flight audited all 15 workflows on bradygaster/squad and identified 19 hardening opportunities.

Full analysis: docs/proposals/ci-hardening-opportunities.md

Priority: P0

Quick Wins (2-3 hours, high impact)

  1. Retry logic on npm install — eliminates ~20-30% of transient CI failures
  2. Job timeout tuning — prevents 6-hour hangs on stuck jobs
  3. npm cache optimization — 30-40% speedup on install steps
  4. Conditional docs quality checks — skip on code-only PRs (10-15% faster)
  5. Publish secret validation — fail fast on misconfiguration

Short-term (5-6 hours, medium ROI)

  • Lockfile stability in preflight
  • Consolidate duplicate steps (create composite action)
  • Ralph cron review
  • API rate limit monitoring
  • Registry health checks

Backlog (strategic, higher complexity)

  • Cross-platform CI (Windows/macOS/Linux matrix)
  • Security scanning (SAST + npm audit)
  • Performance regression detection
  • E2E CLI integration tests
  • Flaky test quarantine system

Routing

  • Booster owns implementation (CI/CD Engineer)
  • FIDO monitors for flaky test patterns (Quality Owner)
  • Flight approves + triages Phase 3 against v1.0 roadmap

Files Needing Changes

File Changes Effort
squad-ci.yml Add npm cache, retry, timeout, conditional Playwright 30 min
squad-npm-publish.yml Add retry, secret validation 20 min
squad-insider-publish.yml Add retry in matrix 15 min
ci-rerun.yml Add npm cache 15 min

Metadata

Metadata

Assignees

No one assigned

    Labels

    go:needs-researchNeeds investigationsquadSquad triage inbox — Lead will assign to a membersquad:archiveResolved by upstream or no longer applicablesquad:fidoAssigned to FIDO (Quality Owner)

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions