Skip to content

[aw-failures] Smoke suite: 4 engines failing on every run for 5+ days — no root-cause fix deployed #29231

@github-actions

Description

@github-actions

Executive Summary

The agentic smoke test suite (engines: Gemini, Crush, OpenCode, Copilot) has been failing on every single run for at least 5 consecutive days (Apr 25–30, 2026). Each batch generates 4 auto-expiring failure issues that get closed as not_planned without fixing root causes, creating a silent feedback loop with no systemic remediation.

This issue consolidates the 6-hour failure window analysis (2026-04-29T19:19–2026-04-30T01:19 UTC) and tracks root-cause fixes.

Failure Clusters

Engine Workflow Failure Type Recurrence Current Issue
gemini smoke-gemini API key invalid (HTTP 400, INVALID_ARGUMENT) Every run, 5+ days #29211
crush smoke-crush EROFS read-only filesystem install failure Every run, 5+ days #29207
opencode smoke-opencode No safe outputs generated (agent never calls noop) Every run, 5+ days #29208
copilot smoke-copilot Exit code 1 (generic, varies) Recurring #29206
Evidence: Audit findings for representative runs

Smoke Gemini (§25138536109):

Smoke Crush (§25138536143):

Smoke OpenCode (§25138536100):

Smoke Copilot (§25138536142):

  • Error: Process completed with exit code 1
  • Varies in root cause between runs (previously: auth missing, now: generic exit)

Existing Issue Correlation

Proposed Fix Roadmap

P0 — Blocks all smoke runs immediately

  • Fix Gemini API key (credential rotation/setup) → sub-issue #aw_gemfix
  • Fix Crush installation EROFS (workflow or package config) → sub-issue #aw_crfix

P1 — Workflow quality

  • Ensure OpenCode smoke prompt calls noop when no action taken

P2 — Systemic

  • Consider deduplicating auto-generated per-engine failure issues into a single recurring tracker to avoid issue churn

Sub-Issues Created

  • #aw_gemfix — Fix invalid Gemini API key for smoke-gemini
  • #aw_crfix — Fix smoke-crush EROFS installation failure

References:

Generated by [aw] Failure Investigator (6h) · ● 327.3K ·

  • expires on May 7, 2026, 1:30 AM UTC


Update — Apr 30 2026 (run §25153298474)

New P0 Finding: MCP Tool Schema Incompatibility Blocking Smoke Claude

A new and previously untracked failure was identified in the 6-hour window (01:34–07:35 UTC):

Smoke Claude run §25149789871 (on PR branch copilot/fix-gemini-api-key-error) fails immediately with:

tools.57.custom.input_schema: input_schema does not support oneOf, allOf, or anyOf at the top level

The Anthropic API (/v1/messages) rejects the tool manifest on the first request — 0 tokens consumed. A separate Smoke Claude run (§25148772845) failed with all MCP servers dropping connection at 0s uptime (tracked in #29255).

Sub-issue created: #aw_mcpsch — fix MCP tool schema incompatibility

Status Improvements

  • Smoke Copilot (§25149789919): succeeded on PR trigger (29 turns, 1M tokens) — no longer consistently failing
  • Smoke OpenCode (§25149789880): concluded success on PR trigger

Persistent Failures (unchanged)

Generated by [aw] Failure Investigator (6h) · ● 484K ·

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions