Executive Summary
The agentic smoke test suite (engines: Gemini, Crush, OpenCode, Copilot) has been failing on every single run for at least 5 consecutive days (Apr 25–30, 2026). Each batch generates 4 auto-expiring failure issues that get closed as not_planned without fixing root causes, creating a silent feedback loop with no systemic remediation.
This issue consolidates the 6-hour failure window analysis (2026-04-29T19:19–2026-04-30T01:19 UTC) and tracks root-cause fixes.
Failure Clusters
| Engine |
Workflow |
Failure Type |
Recurrence |
Current Issue |
| gemini |
smoke-gemini |
API key invalid (HTTP 400, INVALID_ARGUMENT) |
Every run, 5+ days |
#29211 |
| crush |
smoke-crush |
EROFS read-only filesystem install failure |
Every run, 5+ days |
#29207 |
| opencode |
smoke-opencode |
No safe outputs generated (agent never calls noop) |
Every run, 5+ days |
#29208 |
| copilot |
smoke-copilot |
Exit code 1 (generic, varies) |
Recurring |
#29206 |
Evidence: Audit findings for representative runs
Smoke Gemini (§25138536109):
Smoke Crush (§25138536143):
Smoke OpenCode (§25138536100):
Smoke Copilot (§25138536142):
- Error:
Process completed with exit code 1
- Varies in root cause between runs (previously: auth missing, now: generic exit)
Existing Issue Correlation
Proposed Fix Roadmap
P0 — Blocks all smoke runs immediately
- Fix Gemini API key (credential rotation/setup) → sub-issue #aw_gemfix
- Fix Crush installation EROFS (workflow or package config) → sub-issue #aw_crfix
P1 — Workflow quality
- Ensure OpenCode smoke prompt calls
noop when no action taken
P2 — Systemic
- Consider deduplicating auto-generated per-engine failure issues into a single recurring tracker to avoid issue churn
Sub-Issues Created
- #aw_gemfix — Fix invalid Gemini API key for smoke-gemini
- #aw_crfix — Fix smoke-crush EROFS installation failure
References:
Generated by [aw] Failure Investigator (6h) · ● 327.3K · ◷
Update — Apr 30 2026 (run §25153298474)
New P0 Finding: MCP Tool Schema Incompatibility Blocking Smoke Claude
A new and previously untracked failure was identified in the 6-hour window (01:34–07:35 UTC):
Smoke Claude run §25149789871 (on PR branch copilot/fix-gemini-api-key-error) fails immediately with:
tools.57.custom.input_schema: input_schema does not support oneOf, allOf, or anyOf at the top level
The Anthropic API (/v1/messages) rejects the tool manifest on the first request — 0 tokens consumed. A separate Smoke Claude run (§25148772845) failed with all MCP servers dropping connection at 0s uptime (tracked in #29255).
Sub-issue created: #aw_mcpsch — fix MCP tool schema incompatibility
Status Improvements
- ✅ Smoke Copilot (§25149789919): succeeded on PR trigger (29 turns, 1M tokens) — no longer consistently failing
- ✅ Smoke OpenCode (§25149789880): concluded success on PR trigger
Persistent Failures (unchanged)
Generated by [aw] Failure Investigator (6h) · ● 484K · ◷
Executive Summary
The agentic smoke test suite (engines: Gemini, Crush, OpenCode, Copilot) has been failing on every single run for at least 5 consecutive days (Apr 25–30, 2026). Each batch generates 4 auto-expiring failure issues that get closed as
not_plannedwithout fixing root causes, creating a silent feedback loop with no systemic remediation.This issue consolidates the 6-hour failure window analysis (2026-04-29T19:19–2026-04-30T01:19 UTC) and tracks root-cause fixes.
Failure Clusters
Evidence: Audit findings for representative runs
Smoke Gemini (§25138536109):
Smoke Crush (§25138536143):
Smoke OpenCode (§25138536100):
success— runner finishes without errornoopSmoke Copilot (§25138536142):
Process completed with exit code 1Existing Issue Correlation
not_plannedwithout deploying fixesProposed Fix Roadmap
P0 — Blocks all smoke runs immediately
P1 — Workflow quality
noopwhen no action takenP2 — Systemic
Sub-Issues Created
References:
Update — Apr 30 2026 (run §25153298474)
New P0 Finding: MCP Tool Schema Incompatibility Blocking Smoke Claude
A new and previously untracked failure was identified in the 6-hour window (01:34–07:35 UTC):
Smoke Claude run §25149789871 (on PR branch
copilot/fix-gemini-api-key-error) fails immediately with:The Anthropic API (
/v1/messages) rejects the tool manifest on the first request — 0 tokens consumed. A separate Smoke Claude run (§25148772845) failed with all MCP servers dropping connection at 0s uptime (tracked in #29255).Sub-issue created: #aw_mcpsch — fix MCP tool schema incompatibility
Status Improvements
Persistent Failures (unchanged)