diff --git a/CLAUDE.md b/CLAUDE.md
new file mode 100644
index 0000000..8651bfa
--- /dev/null
+++ b/CLAUDE.md
@@ -0,0 +1,59 @@
+# CLAUDE.md
+
+This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
+
+## Project
+
+OpenCode CLI Enforcer — an OpenCode plugin that orchestrates Claude, Gemini, and Codex CLIs with resilience (circuit breakers, retry with backoff, automatic fallback). Published to npm as `opencode-cli-enforcer`.
+
+## Commands
+
+```bash
+bun install              # Install dependencies
+bun test                 # Run all tests
+bun test --watch         # Run tests in watch mode
+bun test tests/retry.test.ts  # Run a single test file
+bun run typecheck        # Type-check without emitting
+bun run build            # Build (no bundle)
+```
+
+Runtime is **Bun** (>=1.3.5), not Node. Tests use Bun's built-in test runner (`bun:test`). There is no linter or formatter configured.
+
+## Architecture
+
+The plugin exports four tools (`cli_exec`, `cli_status`, `cli_list`, `cli_route`) via the OpenCode plugin interface.
+
+**Request flow through `cli_exec`:**
+
+```
+index.ts (plugin entry, tool definitions, hooks)
+  → resilience.ts (global time budget, retry + circuit breaker + fallback)
+    → circuit-breaker.ts (per-CLI isolation: 3 failures OR 5 timeouts → open)
+    → retry.ts (exponential backoff with jitter, abort-aware sleep)
+    → executor.ts (execa wrapper: structured results, Windows .cmd handling, PATH augmentation)
+      → cli-defs.ts (arg builders + dynamic --max-turns for Claude)
+      → detection.ts (CLI availability via which/where, 5-min cache)
+      → safe-env.ts (allowlisted env vars only, no API keys)
+      → redact.ts (strips API keys from output)
+      → error-classifier.ts (transient/rate_limit/permanent/crash)
+```
+
+**Key state in `index.ts`:** three `Map`s — `breakers` (circuit breaker per CLI), `cliAvailability` (detection results), `usageStats` (call counts/timing). CLI detection runs non-blocking at startup via `Promise.allSettled`.
+
+**Global time budget** (`resilience.ts`): a single timeout budget shared across all retries AND fallbacks, preventing timeout multiplication. Process timeouts skip retries and go straight to fallback.
+
+**Circuit breaker** has separate thresholds: opens after 3 failures OR 5 timeouts (slow ≠ broken), cooldown 60s. **Retry**: max 2 retries, 1s base delay, 10s max, 0.3 jitter factor.
+
+**Role-based routing** (`cli-defs.ts`): 6 agent roles (manager, coordinator, developer, researcher, reviewer, architect) map to optimal CLI providers via `cli_route`.
+
+## Cross-Platform
+
+- `platform.ts` exports `PLATFORM` and `IS_WINDOWS`
+- Binary detection uses `which` (Unix) / `where` (Windows) with 5-minute cache
+- Windows: `.cmd/.bat` shim handling via `cmd /c`, PATH augmentation (npm, scoop, cargo, pnpm)
+- Large prompts (>30KB) delivered via stdin to avoid OS arg-length limits
+- CI runs on ubuntu, windows, and macos
+
+## Release
+
+The release workflow (`.github/workflows/release.yml`) requires a production environment approval gate, publishes to npm with provenance attestation using Node 22, and creates a GitHub release with a git tag.
diff --git a/README.md b/README.md
index 1e47ef0..4238e6e 100644
--- a/README.md
+++ b/README.md
@@ -1,21 +1,92 @@
 <p align="center">
-  <strong>opencode-cli-enforcer</strong><br>
-  <em>Resilient multi-LLM CLI orchestration for OpenCode</em>
+  <img src="docs/assets/logo.svg" alt="opencode-cli-enforcer" width="120" />
+</p>
+
+<h1 align="center">opencode-cli-enforcer</h1>
+
+<p align="center">
+  <strong>Resilient multi-LLM CLI orchestration for OpenCode</strong><br>
+  <em>Execute Claude, Gemini &amp; Codex with circuit breakers, smart retry, automatic fallback, and role-based routing.</em>
 </p>
 
 <p align="center">
   <a href="https://github.com/lleontor705/opencode-cli-enforcer/actions/workflows/ci.yml"><img src="https://github.com/lleontor705/opencode-cli-enforcer/actions/workflows/ci.yml/badge.svg" alt="CI" /></a>
-  <a href="https://www.npmjs.com/package/opencode-cli-enforcer"><img src="https://img.shields.io/npm/v/opencode-cli-enforcer" alt="npm" /></a>
+  <a href="https://www.npmjs.com/package/opencode-cli-enforcer"><img src="https://img.shields.io/npm/v/opencode-cli-enforcer?color=cb3837" alt="npm" /></a>
   <a href="https://github.com/lleontor705/opencode-cli-enforcer/blob/master/LICENSE"><img src="https://img.shields.io/badge/license-MIT-blue.svg" alt="License" /></a>
+  <img src="https://img.shields.io/badge/platform-windows%20%7C%20macos%20%7C%20linux-brightgreen" alt="Platform" />
+  <img src="https://img.shields.io/badge/runtime-Bun%20%E2%89%A51.3.5-f472b6" alt="Bun" />
+</p>
+
+---
+
+## Why opencode-cli-enforcer?
+
+Running AI CLIs in production is fragile. Processes timeout, rate limits hit, binaries disappear. Calling three different CLIs means three different failure modes, arg formats, and platform quirks.
+
+**opencode-cli-enforcer** wraps all of that into a single, resilient plugin:
+
+<table>
+<tr>
+<td width="50%">
+
+**Without this plugin**
+- Manual subprocess management
+- No retry on transient failures
+- One CLI down = entire workflow blocked
+- OS-specific arg handling per CLI
+- Secrets leak into error logs
+- No visibility into CLI health
+
+</td>
+<td width="50%">
+
+**With this plugin**
+- 4 tools, zero boilerplate
+- Exponential backoff + jitter retry
+- Automatic fallback chain across providers
+- Cross-platform (Windows `.cmd` shims, PATH augmentation)
+- Secret redaction on all output
+- Real-time health dashboard
+
+</td>
+</tr>
+</table>
+
+---
+
+## Architecture
+
+<p align="center">
+  <img src="docs/assets/architecture.svg" alt="Architecture diagram" width="780" />
 </p>
 
+```
+cli_exec(prompt)
+  |
+  v
++----------------------------------------------------------+
+|                  Resilience Engine                        |
+|                                                          |
+|  Global Time Budget (shared across ALL attempts)         |
+|  +---------+     +---------+     +---------+             |
+|  | Claude  | --> | Gemini  | --> | Codex   |  fallback   |
+|  +---------+     +---------+     +---------+  chain      |
+|       |               |               |                  |
+|       v               v               v                  |
+|  [Circuit Breaker] -----> [Retry w/ Backoff] --> [execa] |
+|  3 failures = open        max 2 retries          10MB    |
+|  5 timeouts = open        1s-10s + jitter        buffer  |
+|  60s cooldown             abort-aware sleep               |
++----------------------------------------------------------+
+```
+
 ---
 
-Execute Claude, Gemini, and Codex CLIs with automatic OS detection, circuit breaker pattern, retry with exponential backoff, and provider fallback. Cross-platform (Windows/macOS/Linux).
+## Quick Start
 
-## Install
+### 1. Install as OpenCode plugin (recommended)
 
-### OpenCode plugin (recommended)
+Add to your OpenCode configuration:
 
 ```json
 {
@@ -23,77 +94,478 @@ Execute Claude, Gemini, and Codex CLIs with automatic OS detection, circuit brea
 }
 ```
 
-### npm
+### 2. Install via npm / bun
 
 ```bash
 bun add opencode-cli-enforcer
+# or
+npm install opencode-cli-enforcer
 ```
 
-## Tools
+### 3. Prerequisites
+
+You need at least **one** CLI installed and authenticated:
+
+| CLI | Install | Auth |
+|-----|---------|------|
+| [Claude Code](https://docs.anthropic.com/en/docs/claude-code) | `npm i -g @anthropic-ai/claude-code` | `claude login` |
+| [Gemini CLI](https://github.com/google-gemini/gemini-cli) | `npm i -g @anthropic-ai/gemini-cli` | `gcloud auth login` |
+| [Codex CLI](https://github.com/openai/codex) | `npm i -g @openai/codex` | `codex auth` |
+
+---
+
+## Tools Reference
+
+### `cli_exec` — Execute with full resilience
+
+The primary tool. Sends a prompt to a CLI with automatic retry, circuit breaker protection, and fallback.
 
-### `cli_exec` — Execute a CLI with full resilience
+```typescript
+cli_exec({
+  cli: "claude",
+  prompt: "Explain the observer pattern with a TypeScript example",
+  mode: "generate",           // "generate" | "analyze"
+  timeout_seconds: 300,       // Global budget: 10-1800s
+  allow_fallback: true        // Try gemini/codex on failure
+})
+```
 
 | Parameter | Type | Default | Description |
 |-----------|------|---------|-------------|
-| `cli` | `"claude" \| "gemini" \| "codex"` | required | Primary CLI |
-| `prompt` | `string` | required | Prompt to send |
-| `mode` | `"generate" \| "analyze"` | `"generate"` | `analyze` enables file reads (Claude) |
-| `timeout_seconds` | `number` | `720` | Max seconds (10-1800) |
-| `allow_fallback` | `boolean` | `true` | Try alternatives on failure |
+| `cli` | `"claude" \| "gemini" \| "codex"` | *required* | Primary CLI provider |
+| `prompt` | `string` | *required* | Prompt to send (max 100KB) |
+| `mode` | `"generate" \| "analyze"` | `"generate"` | `analyze` enables file reads (Claude only) |
+| `timeout_seconds` | `number` | `720` | Global timeout budget in seconds |
+| `allow_fallback` | `boolean` | `true` | Auto-fallback to alternative providers |
+
+**Response:**
+
+```jsonc
+{
+  "success": true,
+  "cli": "claude",
+  "stdout": "The Observer pattern is a behavioral design pattern...",
+  "stderr": "",
+  "duration_ms": 4523,
+  "timed_out": false,
+  "used_fallback": false,
+  "fallback_chain": ["claude"],
+  "error": null,
+  "error_class": null,           // "transient" | "rate_limit" | "permanent" | "crash"
+  "circuit_state": "closed",     // "closed" | "open" | "half-open"
+  "attempt": 1,
+  "max_attempts": 3
+}
+```
+
+---
+
+### `cli_status` — Health dashboard
+
+Returns real-time health for all providers: installation status, circuit breaker state, and usage statistics.
 
-### `cli_status` — Health check dashboard
+```typescript
+cli_status({})
+```
+
+**Response:**
+
+```jsonc
+{
+  "platform": "windows",
+  "detection_complete": true,
+  "retry_config": { "max_retries": 2, "base_delay_ms": 1000, "max_delay_ms": 10000 },
+  "breaker_config": { "failure_threshold": 3, "timeout_threshold": 5, "cooldown_seconds": 60 },
+  "providers": [
+    {
+      "name": "claude",
+      "installed": true,
+      "path": "/usr/local/bin/claude",
+      "version": "1.0.16",
+      "circuit_breaker": {
+        "state": "closed",
+        "consecutive_failures": 0,
+        "consecutive_timeouts": 0,
+        "total_executions": 12,
+        "total_failures": 1,
+        "total_timeouts": 0
+      },
+      "usage": {
+        "total_calls": 12,
+        "success_rate": "92%",
+        "avg_duration_ms": 3400
+      },
+      "fallback_order": ["gemini", "codex"]
+    }
+    // ... gemini, codex
+  ]
+}
+```
+
+---
+
+### `cli_list` — List installed providers
+
+Quick check of which CLIs are available on the system.
+
+```typescript
+cli_list({})
+```
+
+**Response:**
+
+```jsonc
+{
+  "installed_count": 2,
+  "providers": [
+    { "provider": "claude", "path": "/usr/local/bin/claude", "version": "1.0.16", "strengths": ["reasoning", "code-analysis", "debugging", "architecture", "planning"] },
+    { "provider": "gemini", "path": "/usr/local/bin/gemini", "version": "0.1.8", "strengths": ["research", "trends", "knowledge", "large-context", "web-search"] }
+  ]
+}
+```
+
+---
+
+### `cli_route` — Role-based routing
 
-Returns platform info, detection status, circuit breaker states, and usage stats for all providers.
+Recommends the best CLI for a task based on agent role. Considers both provider strengths and real-time availability.
+
+```typescript
+cli_route({
+  role: "developer",
+  task_description: "Refactor the auth module to use JWT"
+})
+```
+
+| Parameter | Type | Description |
+|-----------|------|-------------|
+| `role` | `"manager" \| "coordinator" \| "developer" \| "researcher" \| "reviewer" \| "architect"` | Agent role |
+| `task_description` | `string?` | Optional context |
+
+**Routing table:**
+
+<p align="center">
+  <img src="docs/assets/routing.svg" alt="Role routing table" width="620" />
+</p>
+
+| Role | Primary CLI | Reasoning |
+|------|------------|-----------|
+| **Manager** | Gemini | Research, trends, large-context analysis |
+| **Coordinator** | Claude | Reasoning, planning, decision-making |
+| **Developer** | Codex | Code generation, refactoring, full-auto |
+| **Researcher** | Gemini | Knowledge synthesis, web search |
+| **Reviewer** | Claude | Code analysis, debugging, quality |
+| **Architect** | Claude | System design, architecture planning |
+
+**Response:**
+
+```jsonc
+{
+  "role": "developer",
+  "task_description": "Refactor the auth module to use JWT",
+  "recommended_cli": "codex",
+  "reasoning": "Role \"developer\" maps to codex (code-generation, edits, refactoring, full-auto).",
+  "fallback_chain": ["codex", "claude", "gemini"],
+  "availability": { "codex": true, "claude": true, "gemini": false }
+}
+```
+
+---
 
 ## Resilience Pipeline
 
+<p align="center">
+  <img src="docs/assets/resilience-pipeline.svg" alt="Resilience pipeline" width="780" />
+</p>
+
+### Global Time Budget
+
+Unlike per-attempt timeouts, the **global time budget** is shared across ALL retries and ALL fallback providers. This prevents timeout multiplication:
+
 ```
-Request --> Circuit Breaker --> Retry (3x, exp backoff) --> Execute (execa)
-               |                        |                        |
-               v                        v                        v
-          If open:                 If exhausted:            On failure:
-          skip to                  try next CLI             record + retry
-          fallback                 in chain
+Traditional:  3 providers x 3 attempts x 300s timeout = 2700s worst case
+This plugin:  300s total budget across everything       =  300s worst case
 ```
 
-**Circuit Breaker States:**
+Each attempt receives the **remaining** seconds, not the full budget. When the budget runs out, execution stops immediately.
 
-| State | Behavior |
-|-------|----------|
-| closed | Normal — requests pass through |
-| open | Blocked — 3+ failures, 60s cooldown |
-| half-open | Probe — 1 request to test recovery |
+### Circuit Breaker
 
-**Fallback Order:** `claude --> gemini --> codex`
+Per-CLI failure isolation with **separate thresholds** for failures and timeouts (because slow ≠ broken):
 
-## Supported CLIs
+<p align="center">
+  <img src="docs/assets/circuit-breaker.svg" alt="Circuit breaker states" width="620" />
+</p>
+
+| State | Behavior | Transition |
+|-------|----------|------------|
+| **Closed** | Normal operation, requests pass through | 3 failures OR 5 timeouts &rarr; Open |
+| **Open** | All requests blocked, provider is skipped | After 60s cooldown &rarr; Half-Open |
+| **Half-Open** | One probe request allowed | Success &rarr; Closed / Failure &rarr; Open |
+
+### Retry with Exponential Backoff
+
+```
+Attempt 0:  immediate
+Attempt 1:  ~1s  + jitter (+-30%)
+Attempt 2:  ~2s  + jitter (+-30%)
+            capped at 10s max
+```
+
+- **Transient errors** (network, socket): standard retry
+- **Rate limits** (429, quota): retry with 3x longer delay
+- **Process timeouts**: skip retries entirely, move to next provider
+- **Permanent errors** (auth, 401/403): skip retries, move to fallback
+- **Crash** (SIGKILL, ENOENT): skip retries, move to fallback
+
+### Error Classification
+
+```
+Error arrives
+  |
+  +-- exitCode 137 / SIGKILL / ENOENT ---------> CRASH      (no retry)
+  +-- 429 / "rate limit" / "quota" -------------> RATE_LIMIT (retry, 3x delay)
+  +-- 401 / 403 / "auth" / "not found" --------> PERMANENT  (no retry)
+  +-- everything else --------------------------> TRANSIENT  (retry)
+```
+
+### Fallback Chain
+
+When a provider fails, the next one in the chain takes over automatically:
+
+```
+Claude ---[fail]---> Gemini ---[fail]---> Codex
+Gemini ---[fail]---> Claude ---[fail]---> Codex
+Codex  ---[fail]---> Claude ---[fail]---> Gemini
+```
+
+---
+
+## Cross-Platform Support
+
+<table>
+<tr>
+<th>Feature</th>
+<th>Windows</th>
+<th>macOS / Linux</th>
+</tr>
+<tr>
+<td>Binary detection</td>
+<td><code>where</code></td>
+<td><code>which</code></td>
+</tr>
+<tr>
+<td><code>.cmd/.bat</code> shims</td>
+<td>Auto-wrapped with <code>cmd /c</code></td>
+<td>N/A</td>
+</tr>
+<tr>
+<td>PATH augmentation</td>
+<td>npm, scoop, cargo, pnpm dirs</td>
+<td>Standard PATH</td>
+</tr>
+<tr>
+<td>Large prompts (&gt;30KB)</td>
+<td colspan="2" align="center">Delivered via <code>stdin</code> to avoid OS arg-length limits</td>
+</tr>
+<tr>
+<td>Environment</td>
+<td colspan="2" align="center">Allowlisted vars only (no secrets leak to subprocesses)</td>
+</tr>
+</table>
+
+### Detection Caching
+
+CLI availability is cached for **5 minutes** to avoid repeated filesystem lookups. The cache covers:
+- Binary path resolution
+- Version detection
+- Both positive and negative results
+
+---
+
+## Security
+
+| Protection | Description |
+|-----------|-------------|
+| **Secret redaction** | API keys (`sk-*`, `key-*`, `AIza*`, `ant-api*`) and Bearer tokens stripped from all output |
+| **Environment filtering** | Only system essentials + proxy vars passed to subprocesses. No API keys — CLIs handle their own auth. |
+| **Input isolation** | Large prompts (>30KB) delivered via stdin, not shell args |
+| **No shell interpolation** | All CLI execution via `execa` (no `shell: true`) |
+
+---
+
+## Examples
+
+### Basic: Ask Claude to review code
+
+```typescript
+const result = await cli_exec({
+  cli: "claude",
+  prompt: "Review this function for bugs:\n\nfunction add(a, b) { return a - b }",
+  mode: "analyze",
+  timeout_seconds: 120
+})
+// result.stdout → "Bug found: the function is named `add` but performs subtraction..."
+```
+
+### Fallback: Primary CLI is down
+
+```typescript
+// Claude's circuit breaker is open (too many recent failures)
+const result = await cli_exec({
+  cli: "claude",
+  prompt: "Generate a REST API for user management",
+  allow_fallback: true
+})
+// result.cli → "gemini" (automatic fallback)
+// result.used_fallback → true
+```
+
+### Role routing: Pick the right tool for the job
+
+```typescript
+// For a developer task, route to Codex (best at code generation)
+const recommendation = await cli_route({
+  role: "developer",
+  task_description: "Implement pagination for the /users endpoint"
+})
+// recommendation.recommended_cli → "codex"
+
+// Then execute with the recommended CLI
+const result = await cli_exec({
+  cli: recommendation.recommended_cli,
+  prompt: "Implement pagination for the /users endpoint using cursor-based pagination"
+})
+```
 
-| CLI | Best For |
-|-----|----------|
-| Claude | Reasoning, code analysis, debugging, architecture |
-| Gemini | Research, broad knowledge, large context |
-| Codex | Code generation, edits, refactoring |
+### Monitor health across providers
 
-## Prerequisites
+```typescript
+const status = await cli_status({})
 
-- [Bun](https://bun.sh) runtime
-- At least one CLI: [Claude Code](https://docs.anthropic.com/en/docs/claude-code), [Gemini CLI](https://github.com/google-gemini/gemini-cli), or [Codex CLI](https://github.com/openai/codex)
+for (const provider of status.providers) {
+  console.log(`${provider.name}: ${provider.circuit_breaker.state} | ${provider.usage.success_rate}`)
+}
+// claude: closed | 95%
+// gemini: closed | 88%
+// codex: open   | 60%  <-- circuit breaker tripped
+```
+
+### Large prompt via stdin
+
+```typescript
+const largeCodebase = readFileSync("src/index.ts", "utf-8") // 45KB file
+const result = await cli_exec({
+  cli: "claude",
+  prompt: `Analyze this codebase for security vulnerabilities:\n\n${largeCodebase}`,
+  mode: "analyze",
+  timeout_seconds: 600
+})
+// Prompt >30KB → automatically delivered via stdin (no OS arg-length issues)
+```
+
+---
+
+## Hooks
+
+The plugin injects two hooks into the OpenCode lifecycle:
+
+### `experimental.chat.system.transform`
+
+Automatically injects CLI availability into the system prompt of every agent (except `orchestrator` and `task_decomposer`), so the LLM knows which tools are available and their current health.
+
+### `tool.execute.after`
+
+Tracks when agents invoke CLIs directly via `bash` instead of using `cli_exec`, incrementing usage counters for observability.
+
+---
+
+## Provider Strengths
+
+<p align="center">
+  <img src="docs/assets/providers.svg" alt="Provider strengths" width="700" />
+</p>
+
+| Provider | Binary | Strengths |
+|----------|--------|-----------|
+| **Claude** | `claude` | Reasoning, code analysis, debugging, architecture, planning |
+| **Gemini** | `gemini` | Research, trends, knowledge, large-context, web search |
+| **Codex** | `codex` | Code generation, edits, refactoring, full-auto mode |
+
+---
+
+## Configuration Reference
+
+### Circuit Breaker Defaults
+
+| Parameter | Value | Description |
+|-----------|-------|-------------|
+| `failureThreshold` | `3` | Consecutive failures before opening |
+| `timeoutThreshold` | `5` | Consecutive timeouts before opening (slow ≠ broken) |
+| `cooldownMs` | `60000` | Milliseconds before open &rarr; half-open |
+| `halfOpenSuccessThreshold` | `1` | Successes in half-open to close |
+
+### Retry Defaults
+
+| Parameter | Value | Description |
+|-----------|-------|-------------|
+| `maxRetries` | `2` | Maximum retry attempts per provider |
+| `baseDelayMs` | `1000` | Base delay for exponential backoff |
+| `maxDelayMs` | `10000` | Maximum delay cap |
+| `jitterFactor` | `0.3` | Random jitter range (+-30%) |
+
+### Executor Defaults
+
+| Parameter | Value | Description |
+|-----------|-------|-------------|
+| `STDIN_THRESHOLD` | `30000` | Characters before switching to stdin delivery |
+| `MAX_BUFFER` | `10MB` | Maximum stdout/stderr buffer |
+
+---
 
 ## Development
 
 ```bash
-bun install
-bun test
+bun install              # Install dependencies
+bun test                 # Run all tests (85 tests)
+bun test --watch         # Watch mode
+bun test tests/retry.test.ts  # Run a single test file
+bun run typecheck        # Type-check without emitting
+bun run build            # Build
+```
+
+### Project Structure
+
+```
+src/
+  index.ts             Plugin entry, 4 tools, 2 hooks
+  resilience.ts        Global time budget, retry + breaker + fallback
+  circuit-breaker.ts   Per-CLI state machine (failures + timeouts)
+  executor.ts          execa wrapper, Windows handling, PATH augmentation
+  cli-defs.ts          Provider configs, arg builders, role routing
+  detection.ts         CLI auto-detection with 5-min cache
+  retry.ts             Exponential backoff, abort-aware sleep
+  error-classifier.ts  Error categorization for retry decisions
+  safe-env.ts          Environment variable allowlist
+  redact.ts            Secret redaction
+  platform.ts          OS detection
+
+tests/
+  8 test files, 85 tests covering all modules
 ```
 
+---
+
 ## Contributing
 
 1. Fork the repo
 2. Create a feature branch from `develop`: `git checkout -b feat/my-feature develop`
 3. Make your changes and add tests
-4. Run `bun test`
+4. Run `bun test` (all 85 must pass)
 5. Open a PR to `develop`
 
+---
+
 ## License
 
-MIT
+[MIT](LICENSE) &copy; [lleontor705](https://github.com/lleontor705)
diff --git a/bun.lock b/bun.lock
index 13ea524..57c149d 100644
--- a/bun.lock
+++ b/bun.lock
@@ -6,8 +6,8 @@
       "name": "opencode-cli-enforcer",
       "dependencies": {
         "@opencode-ai/plugin": "^1.2.26",
-        "cockatiel": "^3.2.1",
         "execa": "^9.6.1",
+        "zod": "^3.23.0",
       },
       "devDependencies": {
         "@types/bun": "latest",
@@ -30,8 +30,6 @@
 
     "bun-types": ["bun-types@1.3.11", "", { "dependencies": { "@types/node": "*" } }, "sha512-1KGPpoxQWl9f6wcZh57LvrPIInQMn2TQ7jsgxqpRzg+l0QPOFvJVH7HmvHo/AiPgwXy+/Thf6Ov3EdVn1vOabg=="],
 
-    "cockatiel": ["cockatiel@3.2.1", "", {}, "sha512-gfrHV6ZPkquExvMh9IOkKsBzNDk6sDuZ6DdBGUBkvFnTCqCxzpuq48RySgP0AnaqQkw2zynOFj9yly6T1Q2G5Q=="],
-
     "cross-spawn": ["cross-spawn@7.0.6", "", { "dependencies": { "path-key": "^3.1.0", "shebang-command": "^2.0.0", "which": "^2.0.1" } }, "sha512-uV2QOWP2nWzsy2aMp8aRibhi9dlzF5Hgh5SHaB9OiTGEyDTiJJyx0uy51QXdyWbtAHNua4XJzUKca3OzKUd3vA=="],
 
     "execa": ["execa@9.6.1", "", { "dependencies": { "@sindresorhus/merge-streams": "^4.0.0", "cross-spawn": "^7.0.6", "figures": "^6.1.0", "get-stream": "^9.0.0", "human-signals": "^8.0.1", "is-plain-obj": "^4.1.0", "is-stream": "^4.0.1", "npm-run-path": "^6.0.0", "pretty-ms": "^9.2.0", "signal-exit": "^4.1.0", "strip-final-newline": "^4.0.0", "yoctocolors": "^2.1.1" } }, "sha512-9Be3ZoN4LmYR90tUoVu2te2BsbzHfhJyfEiAVfz7N5/zv+jduIfLrV2xdQXOHbaD6KgpGdO9PRPM1Y4Q9QkPkA=="],
@@ -76,7 +74,9 @@
 
     "yoctocolors": ["yoctocolors@2.1.2", "", {}, "sha512-CzhO+pFNo8ajLM2d2IW/R93ipy99LWjtwblvC1RsoSUMZgyLbYFr221TnSNT7GjGdYui6P459mw9JH/g/zW2ug=="],
 
-    "zod": ["zod@4.1.8", "", {}, "sha512-5R1P+WwQqmmMIEACyzSvo4JXHY5WiAFHRMg+zBZKgKS+Q1viRa0C1hmUKtHltoIFKtIdki3pRxkmpP74jnNYHQ=="],
+    "zod": ["zod@3.25.76", "", {}, "sha512-gzUt/qt81nXsFGKIFcC3YnfEAx5NkunCfnDlvuBSSFS02bcXu4Lmea0AFIUwbLWxWPx3d9p8S5QoaujKcNQxcQ=="],
+
+    "@opencode-ai/plugin/zod": ["zod@4.1.8", "", {}, "sha512-5R1P+WwQqmmMIEACyzSvo4JXHY5WiAFHRMg+zBZKgKS+Q1viRa0C1hmUKtHltoIFKtIdki3pRxkmpP74jnNYHQ=="],
 
     "npm-run-path/path-key": ["path-key@4.0.0", "", {}, "sha512-haREypq7xkM7ErfgIyA0z+Bj4AGKlMSdlQE2jvJo6huWD1EdkKYV+G/T4nq0YEF2vgTT8kqMFKo1uHn950r4SQ=="],
   }
diff --git a/docs/assets/architecture.svg b/docs/assets/architecture.svg
new file mode 100644
index 0000000..0570a24
--- /dev/null
+++ b/docs/assets/architecture.svg
@@ -0,0 +1,124 @@
+<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 780 420" width="780" height="420">
+  <style>
+    text { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', sans-serif; }
+    .title { font-size: 14px; font-weight: 700; fill: #1e1b4b; }
+    .label { font-size: 11px; fill: #4b5563; }
+    .label-sm { font-size: 10px; fill: #6b7280; }
+    .label-white { font-size: 11px; fill: #ffffff; font-weight: 600; }
+    .label-bold { font-size: 12px; fill: #1e1b4b; font-weight: 600; }
+  </style>
+  <defs>
+    <marker id="arrow" markerWidth="8" markerHeight="6" refX="8" refY="3" orient="auto">
+      <path d="M0,0 L8,3 L0,6" fill="#6366f1"/>
+    </marker>
+    <marker id="arrow-gray" markerWidth="8" markerHeight="6" refX="8" refY="3" orient="auto">
+      <path d="M0,0 L8,3 L0,6" fill="#9ca3af"/>
+    </marker>
+  </defs>
+
+  <!-- Background -->
+  <rect width="780" height="420" rx="12" fill="#fafafe"/>
+
+  <!-- Header -->
+  <text x="390" y="30" text-anchor="middle" class="title" font-size="16">OpenCode CLI Enforcer — Architecture</text>
+
+  <!-- OpenCode Agent box -->
+  <rect x="30" y="55" width="160" height="50" rx="8" fill="#f0fdf4" stroke="#22c55e" stroke-width="1.5"/>
+  <text x="110" y="75" text-anchor="middle" class="label-bold">OpenCode Agent</text>
+  <text x="110" y="90" text-anchor="middle" class="label-sm">system prompt + hooks</text>
+
+  <!-- Arrow down -->
+  <line x1="110" y1="105" x2="110" y2="135" stroke="#6366f1" stroke-width="2" marker-end="url(#arrow)"/>
+
+  <!-- Plugin box -->
+  <rect x="30" y="140" width="160" height="50" rx="8" fill="#eef2ff" stroke="#6366f1" stroke-width="1.5"/>
+  <text x="110" y="160" text-anchor="middle" class="label-bold">Plugin Entry</text>
+  <text x="110" y="175" text-anchor="middle" class="label-sm">index.ts — 4 tools, 2 hooks</text>
+
+  <!-- Arrow right to tools -->
+  <line x1="190" y1="165" x2="240" y2="165" stroke="#6366f1" stroke-width="2" marker-end="url(#arrow)"/>
+
+  <!-- Tools column -->
+  <rect x="250" y="60" width="140" height="38" rx="6" fill="#6366f1"/>
+  <text x="320" y="84" text-anchor="middle" class="label-white">cli_exec</text>
+
+  <rect x="250" y="108" width="140" height="38" rx="6" fill="#7c3aed"/>
+  <text x="320" y="132" text-anchor="middle" class="label-white">cli_status</text>
+
+  <rect x="250" y="156" width="140" height="38" rx="6" fill="#8b5cf6"/>
+  <text x="320" y="180" text-anchor="middle" class="label-white">cli_list</text>
+
+  <rect x="250" y="204" width="140" height="38" rx="6" fill="#a78bfa"/>
+  <text x="320" y="228" text-anchor="middle" class="label-white">cli_route</text>
+
+  <!-- Arrow from cli_exec to resilience -->
+  <line x1="390" y1="79" x2="440" y2="79" stroke="#6366f1" stroke-width="2" marker-end="url(#arrow)"/>
+
+  <!-- Resilience Engine box -->
+  <rect x="450" y="50" width="300" height="360" rx="10" fill="#fefce8" stroke="#eab308" stroke-width="1.5" stroke-dasharray="4"/>
+  <text x="600" y="72" text-anchor="middle" class="title">Resilience Engine</text>
+
+  <!-- Global Budget -->
+  <rect x="470" y="82" width="260" height="32" rx="6" fill="#fef3c7" stroke="#f59e0b" stroke-width="1"/>
+  <text x="600" y="103" text-anchor="middle" class="label-bold" fill="#92400e">Global Time Budget (shared)</text>
+
+  <!-- Provider boxes -->
+  <rect x="480" y="130" width="75" height="45" rx="6" fill="#dbeafe" stroke="#3b82f6" stroke-width="1.2"/>
+  <text x="517" y="148" text-anchor="middle" class="label-sm" fill="#1e40af" font-weight="600">Claude</text>
+  <text x="517" y="164" text-anchor="middle" class="label-sm">reasoning</text>
+
+  <rect x="563" y="130" width="75" height="45" rx="6" fill="#dcfce7" stroke="#22c55e" stroke-width="1.2"/>
+  <text x="600" y="148" text-anchor="middle" class="label-sm" fill="#166534" font-weight="600">Gemini</text>
+  <text x="600" y="164" text-anchor="middle" class="label-sm">research</text>
+
+  <rect x="646" y="130" width="75" height="45" rx="6" fill="#fce7f3" stroke="#ec4899" stroke-width="1.2"/>
+  <text x="683" y="148" text-anchor="middle" class="label-sm" fill="#9d174d" font-weight="600">Codex</text>
+  <text x="683" y="164" text-anchor="middle" class="label-sm">code-gen</text>
+
+  <!-- Fallback arrows -->
+  <line x1="555" y1="152" x2="563" y2="152" stroke="#9ca3af" stroke-width="1.5" marker-end="url(#arrow-gray)"/>
+  <line x1="638" y1="152" x2="646" y2="152" stroke="#9ca3af" stroke-width="1.5" marker-end="url(#arrow-gray)"/>
+
+  <!-- Arrow down from providers -->
+  <line x1="600" y1="175" x2="600" y2="200" stroke="#6366f1" stroke-width="2" marker-end="url(#arrow)"/>
+
+  <!-- Circuit Breaker -->
+  <rect x="480" y="205" width="240" height="45" rx="6" fill="#fee2e2" stroke="#ef4444" stroke-width="1"/>
+  <text x="600" y="223" text-anchor="middle" class="label-bold" fill="#991b1b">Circuit Breaker</text>
+  <text x="600" y="240" text-anchor="middle" class="label-sm">3 failures OR 5 timeouts = open | 60s cooldown</text>
+
+  <!-- Arrow down -->
+  <line x1="600" y1="250" x2="600" y2="268" stroke="#6366f1" stroke-width="2" marker-end="url(#arrow)"/>
+
+  <!-- Retry -->
+  <rect x="480" y="273" width="240" height="45" rx="6" fill="#fef9c3" stroke="#eab308" stroke-width="1"/>
+  <text x="600" y="291" text-anchor="middle" class="label-bold" fill="#854d0e">Retry + Backoff</text>
+  <text x="600" y="308" text-anchor="middle" class="label-sm">max 2 retries | 1s-10s + jitter | abort-aware</text>
+
+  <!-- Arrow down -->
+  <line x1="600" y1="318" x2="600" y2="336" stroke="#6366f1" stroke-width="2" marker-end="url(#arrow)"/>
+
+  <!-- Executor -->
+  <rect x="480" y="341" width="240" height="45" rx="6" fill="#e0e7ff" stroke="#6366f1" stroke-width="1"/>
+  <text x="600" y="359" text-anchor="middle" class="label-bold" fill="#3730a3">Executor (execa)</text>
+  <text x="600" y="376" text-anchor="middle" class="label-sm">stdin >30KB | Windows .cmd | 10MB buffer</text>
+
+  <!-- Side modules -->
+  <rect x="30" y="240" width="130" height="32" rx="6" fill="#f3e8ff" stroke="#a855f7" stroke-width="1"/>
+  <text x="95" y="261" text-anchor="middle" class="label-sm" fill="#7e22ce">error-classifier.ts</text>
+
+  <rect x="30" y="282" width="130" height="32" rx="6" fill="#f3e8ff" stroke="#a855f7" stroke-width="1"/>
+  <text x="95" y="303" text-anchor="middle" class="label-sm" fill="#7e22ce">detection.ts (cache)</text>
+
+  <rect x="30" y="324" width="130" height="32" rx="6" fill="#f3e8ff" stroke="#a855f7" stroke-width="1"/>
+  <text x="95" y="345" text-anchor="middle" class="label-sm" fill="#7e22ce">safe-env.ts</text>
+
+  <rect x="30" y="366" width="130" height="32" rx="6" fill="#f3e8ff" stroke="#a855f7" stroke-width="1"/>
+  <text x="95" y="387" text-anchor="middle" class="label-sm" fill="#7e22ce">redact.ts</text>
+
+  <!-- Connecting lines from side modules -->
+  <line x1="160" y1="256" x2="450" y2="256" stroke="#a855f7" stroke-width="1" stroke-dasharray="3" opacity="0.5"/>
+  <line x1="160" y1="298" x2="450" y2="298" stroke="#a855f7" stroke-width="1" stroke-dasharray="3" opacity="0.5"/>
+  <line x1="160" y1="340" x2="450" y2="363" stroke="#a855f7" stroke-width="1" stroke-dasharray="3" opacity="0.5"/>
+  <line x1="160" y1="382" x2="450" y2="370" stroke="#a855f7" stroke-width="1" stroke-dasharray="3" opacity="0.5"/>
+</svg>
diff --git a/docs/assets/circuit-breaker.svg b/docs/assets/circuit-breaker.svg
new file mode 100644
index 0000000..bf79955
--- /dev/null
+++ b/docs/assets/circuit-breaker.svg
@@ -0,0 +1,63 @@
+<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 620 280" width="620" height="280">
+  <style>
+    text { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', sans-serif; }
+    .state { font-size: 14px; font-weight: 700; fill: #ffffff; }
+    .desc { font-size: 10px; fill: #ffffff; opacity: 0.9; }
+    .trans { font-size: 10px; fill: #4b5563; }
+    .title { font-size: 15px; font-weight: 700; fill: #1e1b4b; }
+  </style>
+  <defs>
+    <marker id="arr" markerWidth="8" markerHeight="6" refX="8" refY="3" orient="auto">
+      <path d="M0,0 L8,3 L0,6" fill="#6366f1"/>
+    </marker>
+    <marker id="arr-red" markerWidth="8" markerHeight="6" refX="8" refY="3" orient="auto">
+      <path d="M0,0 L8,3 L0,6" fill="#ef4444"/>
+    </marker>
+    <marker id="arr-green" markerWidth="8" markerHeight="6" refX="8" refY="3" orient="auto">
+      <path d="M0,0 L8,3 L0,6" fill="#22c55e"/>
+    </marker>
+  </defs>
+
+  <rect width="620" height="280" rx="12" fill="#fafafe"/>
+  <text x="310" y="30" text-anchor="middle" class="title">Circuit Breaker State Machine</text>
+
+  <!-- CLOSED state -->
+  <rect x="40" y="80" width="150" height="80" rx="40" fill="#22c55e"/>
+  <text x="115" y="115" text-anchor="middle" class="state">CLOSED</text>
+  <text x="115" y="133" text-anchor="middle" class="desc">Requests pass through</text>
+
+  <!-- OPEN state -->
+  <rect x="240" y="80" width="150" height="80" rx="40" fill="#ef4444"/>
+  <text x="315" y="115" text-anchor="middle" class="state">OPEN</text>
+  <text x="315" y="133" text-anchor="middle" class="desc">All requests blocked</text>
+
+  <!-- HALF-OPEN state -->
+  <rect x="440" y="80" width="150" height="80" rx="40" fill="#f59e0b"/>
+  <text x="515" y="115" text-anchor="middle" class="state">HALF-OPEN</text>
+  <text x="515" y="133" text-anchor="middle" class="desc">One probe allowed</text>
+
+  <!-- CLOSED → OPEN -->
+  <path d="M190 105 Q215 70 240 105" fill="none" stroke="#ef4444" stroke-width="2" marker-end="url(#arr-red)"/>
+  <text x="215" y="65" text-anchor="middle" class="trans">3 failures</text>
+  <text x="215" y="77" text-anchor="middle" class="trans">OR 5 timeouts</text>
+
+  <!-- OPEN → HALF-OPEN -->
+  <path d="M390 105 Q415 70 440 105" fill="none" stroke="#6366f1" stroke-width="2" marker-end="url(#arr)"/>
+  <text x="415" y="65" text-anchor="middle" class="trans">60s cooldown</text>
+
+  <!-- HALF-OPEN → CLOSED (success) -->
+  <path d="M440 150 C380 220 200 220 120 160" fill="none" stroke="#22c55e" stroke-width="2" marker-end="url(#arr-green)"/>
+  <text x="280" y="230" text-anchor="middle" class="trans" fill="#22c55e" font-weight="600">1 success</text>
+
+  <!-- HALF-OPEN → OPEN (failure) -->
+  <path d="M440 140 Q415 190 390 140" fill="none" stroke="#ef4444" stroke-width="2" marker-end="url(#arr-red)"/>
+  <text x="415" y="198" text-anchor="middle" class="trans" fill="#ef4444">failure / timeout</text>
+
+  <!-- Legend -->
+  <rect x="40" y="252" width="12" height="12" rx="2" fill="#22c55e"/>
+  <text x="58" y="263" class="trans">Normal operation</text>
+  <rect x="180" y="252" width="12" height="12" rx="2" fill="#ef4444"/>
+  <text x="198" y="263" class="trans">Provider skipped</text>
+  <rect x="320" y="252" width="12" height="12" rx="2" fill="#f59e0b"/>
+  <text x="338" y="263" class="trans">Testing recovery</text>
+</svg>
diff --git a/docs/assets/logo.svg b/docs/assets/logo.svg
new file mode 100644
index 0000000..b3ab13b
--- /dev/null
+++ b/docs/assets/logo.svg
@@ -0,0 +1,31 @@
+<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 120 120" width="120" height="120">
+  <defs>
+    <linearGradient id="bg" x1="0%" y1="0%" x2="100%" y2="100%">
+      <stop offset="0%" style="stop-color:#6366f1"/>
+      <stop offset="100%" style="stop-color:#8b5cf6"/>
+    </linearGradient>
+    <linearGradient id="shield" x1="0%" y1="0%" x2="100%" y2="100%">
+      <stop offset="0%" style="stop-color:#ffffff;stop-opacity:0.95"/>
+      <stop offset="100%" style="stop-color:#e0e7ff;stop-opacity:0.9"/>
+    </linearGradient>
+  </defs>
+  <rect width="120" height="120" rx="24" fill="url(#bg)"/>
+  <!-- Shield shape -->
+  <path d="M60 22 L88 36 L88 62 Q88 82 60 98 Q32 82 32 62 L32 36 Z" fill="url(#shield)" opacity="0.95"/>
+  <!-- Circuit lines -->
+  <line x1="48" y1="48" x2="72" y2="48" stroke="#6366f1" stroke-width="3" stroke-linecap="round"/>
+  <line x1="48" y1="60" x2="72" y2="60" stroke="#8b5cf6" stroke-width="3" stroke-linecap="round"/>
+  <line x1="48" y1="72" x2="72" y2="72" stroke="#a78bfa" stroke-width="3" stroke-linecap="round"/>
+  <!-- Connection dots -->
+  <circle cx="44" cy="48" r="3.5" fill="#6366f1"/>
+  <circle cx="76" cy="48" r="3.5" fill="#6366f1"/>
+  <circle cx="44" cy="60" r="3.5" fill="#8b5cf6"/>
+  <circle cx="76" cy="60" r="3.5" fill="#8b5cf6"/>
+  <circle cx="44" cy="72" r="3.5" fill="#a78bfa"/>
+  <circle cx="76" cy="72" r="3.5" fill="#a78bfa"/>
+  <!-- Vertical connectors -->
+  <line x1="44" y1="48" x2="44" y2="72" stroke="#6366f1" stroke-width="2" opacity="0.4"/>
+  <line x1="76" y1="48" x2="76" y2="72" stroke="#6366f1" stroke-width="2" opacity="0.4"/>
+  <!-- Checkmark -->
+  <path d="M54 58 L59 64 L70 50" fill="none" stroke="#22c55e" stroke-width="3.5" stroke-linecap="round" stroke-linejoin="round"/>
+</svg>
diff --git a/docs/assets/providers.svg b/docs/assets/providers.svg
new file mode 100644
index 0000000..dcce5c9
--- /dev/null
+++ b/docs/assets/providers.svg
@@ -0,0 +1,76 @@
+<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 700 260" width="700" height="260">
+  <style>
+    text { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', sans-serif; }
+    .name { font-size: 16px; font-weight: 700; fill: #ffffff; }
+    .binary { font-size: 11px; fill: #ffffff; opacity: 0.8; font-family: monospace; }
+    .tag { font-size: 10px; fill: #ffffff; font-weight: 500; }
+    .title { font-size: 15px; font-weight: 700; fill: #1e1b4b; }
+    .fallback { font-size: 9px; fill: #6b7280; }
+  </style>
+
+  <rect width="700" height="260" rx="12" fill="#fafafe"/>
+  <text x="350" y="28" text-anchor="middle" class="title">Supported CLI Providers</text>
+
+  <!-- Claude -->
+  <rect x="20" y="50" width="210" height="190" rx="12" fill="#3b82f6"/>
+  <text x="125" y="82" text-anchor="middle" class="name">Claude</text>
+  <text x="125" y="100" text-anchor="middle" class="binary">$ claude</text>
+
+  <rect x="35" y="115" width="82" height="22" rx="11" fill="rgba(255,255,255,0.2)"/>
+  <text x="76" y="130" text-anchor="middle" class="tag">reasoning</text>
+
+  <rect x="123" y="115" width="92" height="22" rx="11" fill="rgba(255,255,255,0.2)"/>
+  <text x="169" y="130" text-anchor="middle" class="tag">code-analysis</text>
+
+  <rect x="35" y="143" width="82" height="22" rx="11" fill="rgba(255,255,255,0.2)"/>
+  <text x="76" y="158" text-anchor="middle" class="tag">debugging</text>
+
+  <rect x="123" y="143" width="92" height="22" rx="11" fill="rgba(255,255,255,0.2)"/>
+  <text x="169" y="158" text-anchor="middle" class="tag">architecture</text>
+
+  <rect x="35" y="171" width="72" height="22" rx="11" fill="rgba(255,255,255,0.2)"/>
+  <text x="71" y="186" text-anchor="middle" class="tag">planning</text>
+
+  <text x="125" y="215" text-anchor="middle" class="fallback">fallback: gemini &rarr; codex</text>
+
+  <!-- Gemini -->
+  <rect x="245" y="50" width="210" height="190" rx="12" fill="#22c55e"/>
+  <text x="350" y="82" text-anchor="middle" class="name">Gemini</text>
+  <text x="350" y="100" text-anchor="middle" class="binary">$ gemini</text>
+
+  <rect x="260" y="115" width="76" height="22" rx="11" fill="rgba(255,255,255,0.2)"/>
+  <text x="298" y="130" text-anchor="middle" class="tag">research</text>
+
+  <rect x="342" y="115" width="98" height="22" rx="11" fill="rgba(255,255,255,0.2)"/>
+  <text x="391" y="130" text-anchor="middle" class="tag">large-context</text>
+
+  <rect x="260" y="143" width="64" height="22" rx="11" fill="rgba(255,255,255,0.2)"/>
+  <text x="292" y="158" text-anchor="middle" class="tag">trends</text>
+
+  <rect x="330" y="143" width="86" height="22" rx="11" fill="rgba(255,255,255,0.2)"/>
+  <text x="373" y="158" text-anchor="middle" class="tag">knowledge</text>
+
+  <rect x="260" y="171" width="86" height="22" rx="11" fill="rgba(255,255,255,0.2)"/>
+  <text x="303" y="186" text-anchor="middle" class="tag">web-search</text>
+
+  <text x="350" y="215" text-anchor="middle" class="fallback">fallback: claude &rarr; codex</text>
+
+  <!-- Codex -->
+  <rect x="470" y="50" width="210" height="190" rx="12" fill="#ec4899"/>
+  <text x="575" y="82" text-anchor="middle" class="name">Codex</text>
+  <text x="575" y="100" text-anchor="middle" class="binary">$ codex</text>
+
+  <rect x="485" y="115" width="104" height="22" rx="11" fill="rgba(255,255,255,0.2)"/>
+  <text x="537" y="130" text-anchor="middle" class="tag">code-generation</text>
+
+  <rect x="595" y="115" width="70" height="22" rx="11" fill="rgba(255,255,255,0.2)"/>
+  <text x="630" y="130" text-anchor="middle" class="tag">full-auto</text>
+
+  <rect x="485" y="143" width="56" height="22" rx="11" fill="rgba(255,255,255,0.2)"/>
+  <text x="513" y="158" text-anchor="middle" class="tag">edits</text>
+
+  <rect x="547" y="143" width="86" height="22" rx="11" fill="rgba(255,255,255,0.2)"/>
+  <text x="590" y="158" text-anchor="middle" class="tag">refactoring</text>
+
+  <text x="575" y="215" text-anchor="middle" class="fallback">fallback: claude &rarr; gemini</text>
+</svg>
diff --git a/docs/assets/resilience-pipeline.svg b/docs/assets/resilience-pipeline.svg
new file mode 100644
index 0000000..4cb87e4
--- /dev/null
+++ b/docs/assets/resilience-pipeline.svg
@@ -0,0 +1,131 @@
+<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 780 380" width="780" height="380">
+  <style>
+    text { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', sans-serif; }
+    .title { font-size: 15px; font-weight: 700; fill: #1e1b4b; }
+    .label { font-size: 11px; fill: #4b5563; }
+    .label-sm { font-size: 10px; fill: #6b7280; }
+    .label-w { font-size: 11px; fill: #ffffff; font-weight: 600; }
+    .step-num { font-size: 20px; font-weight: 700; fill: #ffffff; }
+  </style>
+  <defs>
+    <marker id="a1" markerWidth="8" markerHeight="6" refX="8" refY="3" orient="auto">
+      <path d="M0,0 L8,3 L0,6" fill="#6366f1"/>
+    </marker>
+    <marker id="a2" markerWidth="8" markerHeight="6" refX="8" refY="3" orient="auto">
+      <path d="M0,0 L8,3 L0,6" fill="#ef4444"/>
+    </marker>
+  </defs>
+
+  <rect width="780" height="380" rx="12" fill="#fafafe"/>
+  <text x="390" y="30" text-anchor="middle" class="title">Resilience Pipeline — Request Flow</text>
+
+  <!-- Step 1: Request -->
+  <circle cx="60" cy="90" r="24" fill="#6366f1"/>
+  <text x="60" y="97" text-anchor="middle" class="step-num">1</text>
+  <text x="60" y="132" text-anchor="middle" class="label" font-weight="600">Request</text>
+  <text x="60" y="146" text-anchor="middle" class="label-sm">cli_exec()</text>
+
+  <line x1="84" y1="90" x2="130" y2="90" stroke="#6366f1" stroke-width="2" marker-end="url(#a1)"/>
+
+  <!-- Step 2: Budget Check -->
+  <rect x="140" y="65" width="120" height="50" rx="8" fill="#fef3c7" stroke="#f59e0b" stroke-width="1.5"/>
+  <circle cx="155" cy="78" r="10" fill="#f59e0b"/>
+  <text x="155" y="82" text-anchor="middle" class="step-num" font-size="12">2</text>
+  <text x="210" y="86" text-anchor="middle" class="label" font-weight="600">Budget OK?</text>
+  <text x="210" y="102" text-anchor="middle" class="label-sm">remaining &gt; 0</text>
+
+  <line x1="260" y1="90" x2="300" y2="90" stroke="#6366f1" stroke-width="2" marker-end="url(#a1)"/>
+
+  <!-- Step 3: Circuit Breaker -->
+  <rect x="310" y="65" width="130" height="50" rx="8" fill="#fee2e2" stroke="#ef4444" stroke-width="1.5"/>
+  <circle cx="325" cy="78" r="10" fill="#ef4444"/>
+  <text x="325" y="82" text-anchor="middle" class="step-num" font-size="12">3</text>
+  <text x="385" y="86" text-anchor="middle" class="label" font-weight="600">Breaker Closed?</text>
+  <text x="385" y="102" text-anchor="middle" class="label-sm">check state</text>
+
+  <line x1="440" y1="90" x2="480" y2="90" stroke="#6366f1" stroke-width="2" marker-end="url(#a1)"/>
+
+  <!-- Step 4: Execute -->
+  <rect x="490" y="65" width="120" height="50" rx="8" fill="#e0e7ff" stroke="#6366f1" stroke-width="1.5"/>
+  <circle cx="505" cy="78" r="10" fill="#6366f1"/>
+  <text x="505" y="82" text-anchor="middle" class="step-num" font-size="12">4</text>
+  <text x="560" y="86" text-anchor="middle" class="label" font-weight="600">Execute CLI</text>
+  <text x="560" y="102" text-anchor="middle" class="label-sm">execa + timeout</text>
+
+  <line x1="610" y1="90" x2="650" y2="90" stroke="#6366f1" stroke-width="2" marker-end="url(#a1)"/>
+
+  <!-- Step 5: Success -->
+  <rect x="660" y="65" width="100" height="50" rx="25" fill="#22c55e"/>
+  <text x="710" y="95" text-anchor="middle" class="label-w">Success!</text>
+
+  <!-- Failure paths -->
+
+  <!-- Budget exhausted -->
+  <line x1="200" y1="115" x2="200" y2="175" stroke="#ef4444" stroke-width="1.5" marker-end="url(#a2)"/>
+  <rect x="140" y="180" width="120" height="36" rx="6" fill="#fef2f2" stroke="#ef4444" stroke-width="1"/>
+  <text x="200" y="203" text-anchor="middle" class="label-sm" fill="#ef4444" font-weight="600">All providers failed</text>
+
+  <!-- Breaker open → skip to fallback -->
+  <line x1="375" y1="115" x2="375" y2="175" stroke="#ef4444" stroke-width="1.5" marker-end="url(#a2)"/>
+  <rect x="310" y="180" width="130" height="36" rx="6" fill="#fef2f2" stroke="#ef4444" stroke-width="1"/>
+  <text x="375" y="203" text-anchor="middle" class="label-sm" fill="#ef4444" font-weight="600">Skip to fallback</text>
+
+  <!-- Execute failure paths -->
+  <line x1="550" y1="115" x2="550" y2="165" stroke="#ef4444" stroke-width="1.5"/>
+
+  <!-- Branch: timeout -->
+  <rect x="460" y="175" width="100" height="36" rx="6" fill="#fef9c3" stroke="#eab308" stroke-width="1"/>
+  <text x="510" y="198" text-anchor="middle" class="label-sm" fill="#854d0e" font-weight="600">Timeout?</text>
+  <line x1="550" y1="165" x2="510" y2="175" stroke="#ef4444" stroke-width="1.5" marker-end="url(#a2)"/>
+
+  <!-- Branch: classify -->
+  <rect x="580" y="175" width="120" height="36" rx="6" fill="#f3e8ff" stroke="#a855f7" stroke-width="1"/>
+  <text x="640" y="198" text-anchor="middle" class="label-sm" fill="#7e22ce" font-weight="600">Classify Error</text>
+  <line x1="550" y1="165" x2="620" y2="175" stroke="#ef4444" stroke-width="1.5" marker-end="url(#a2)"/>
+
+  <!-- Timeout → skip retries -->
+  <line x1="510" y1="211" x2="510" y2="240" stroke="#eab308" stroke-width="1.5" marker-end="url(#a2)"/>
+  <rect x="450" y="245" width="120" height="30" rx="6" fill="#fef3c7" stroke="#eab308" stroke-width="1"/>
+  <text x="510" y="265" text-anchor="middle" class="label-sm" fill="#854d0e">Skip retries</text>
+
+  <!-- Classify branches -->
+  <line x1="640" y1="211" x2="640" y2="230" stroke="#a855f7" stroke-width="1.5"/>
+
+  <!-- Transient -->
+  <rect x="580" y="240" width="80" height="28" rx="6" fill="#dcfce7" stroke="#22c55e" stroke-width="1"/>
+  <text x="620" y="259" text-anchor="middle" class="label-sm" fill="#166534">Transient</text>
+  <line x1="640" y1="230" x2="620" y2="240" stroke="#a855f7" stroke-width="1"/>
+
+  <!-- Permanent -->
+  <rect x="670" y="240" width="80" height="28" rx="6" fill="#fee2e2" stroke="#ef4444" stroke-width="1"/>
+  <text x="710" y="259" text-anchor="middle" class="label-sm" fill="#991b1b">Permanent</text>
+  <line x1="640" y1="230" x2="710" y2="240" stroke="#a855f7" stroke-width="1"/>
+
+  <!-- Transient → retry -->
+  <line x1="620" y1="268" x2="620" y2="295" stroke="#22c55e" stroke-width="1.5" marker-end="url(#a1)"/>
+  <rect x="570" y="300" width="100" height="28" rx="6" fill="#e0e7ff" stroke="#6366f1" stroke-width="1"/>
+  <text x="620" y="319" text-anchor="middle" class="label-sm" fill="#3730a3" font-weight="600">Retry (backoff)</text>
+
+  <!-- Permanent → fallback -->
+  <line x1="710" y1="268" x2="710" y2="295" stroke="#ef4444" stroke-width="1.5" marker-end="url(#a2)"/>
+  <rect x="660" y="300" width="100" height="28" rx="6" fill="#fef2f2" stroke="#ef4444" stroke-width="1"/>
+  <text x="710" y="319" text-anchor="middle" class="label-sm" fill="#ef4444" font-weight="600">Next provider</text>
+
+  <!-- Timeout → fallback too -->
+  <line x1="510" y1="275" x2="510" y2="300" stroke="#eab308" stroke-width="1.5"/>
+  <path d="M510 300 Q510 340 660 312" fill="none" stroke="#eab308" stroke-width="1.5" marker-end="url(#a2)"/>
+
+  <!-- Retry loops back -->
+  <path d="M570 314 Q490 314 490 100 Q490 90 490 90" fill="none" stroke="#6366f1" stroke-width="1.5" stroke-dasharray="4" marker-end="url(#a1)"/>
+  <text x="472" y="210" text-anchor="middle" class="label-sm" fill="#6366f1" transform="rotate(-90,472,210)">retry loop</text>
+
+  <!-- Legend -->
+  <rect x="30" y="345" width="12" height="12" rx="2" fill="#22c55e"/>
+  <text x="48" y="356" class="label-sm">Success path</text>
+  <rect x="140" y="345" width="12" height="12" rx="2" fill="#ef4444"/>
+  <text x="158" y="356" class="label-sm">Failure / fallback</text>
+  <rect x="260" y="345" width="12" height="12" rx="2" fill="#eab308"/>
+  <text x="278" y="356" class="label-sm">Timeout (skip retries)</text>
+  <rect x="410" y="345" width="12" height="12" rx="2" fill="#6366f1"/>
+  <text x="428" y="356" class="label-sm">Retry with backoff</text>
+</svg>
diff --git a/docs/assets/routing.svg b/docs/assets/routing.svg
new file mode 100644
index 0000000..bf520ef
--- /dev/null
+++ b/docs/assets/routing.svg
@@ -0,0 +1,108 @@
+<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 620 340" width="620" height="340">
+  <style>
+    text { font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', sans-serif; }
+    .title { font-size: 15px; font-weight: 700; fill: #1e1b4b; }
+    .role { font-size: 12px; font-weight: 600; fill: #1e1b4b; }
+    .provider { font-size: 11px; font-weight: 600; }
+    .reason { font-size: 9px; fill: #6b7280; }
+  </style>
+  <defs>
+    <marker id="ar" markerWidth="7" markerHeight="5" refX="7" refY="2.5" orient="auto">
+      <path d="M0,0 L7,2.5 L0,5" fill="#6366f1"/>
+    </marker>
+    <marker id="ar-dim" markerWidth="7" markerHeight="5" refX="7" refY="2.5" orient="auto">
+      <path d="M0,0 L7,2.5 L0,5" fill="#d1d5db"/>
+    </marker>
+  </defs>
+
+  <rect width="620" height="340" rx="12" fill="#fafafe"/>
+  <text x="310" y="28" text-anchor="middle" class="title">Role-Based CLI Routing</text>
+
+  <!-- Role column -->
+  <text x="80" y="58" text-anchor="middle" class="role" fill="#6366f1">Agent Role</text>
+
+  <!-- Provider columns -->
+  <text x="250" y="58" text-anchor="middle" class="role" fill="#6366f1">Primary</text>
+  <text x="400" y="58" text-anchor="middle" class="role" fill="#9ca3af">Fallback 1</text>
+  <text x="540" y="58" text-anchor="middle" class="role" fill="#d1d5db">Fallback 2</text>
+
+  <!-- Row backgrounds -->
+  <rect x="10" y="68" width="600" height="42" rx="4" fill="#f8fafc"/>
+  <rect x="10" y="110" width="600" height="42" rx="4" fill="#ffffff"/>
+  <rect x="10" y="152" width="600" height="42" rx="4" fill="#f8fafc"/>
+  <rect x="10" y="194" width="600" height="42" rx="4" fill="#ffffff"/>
+  <rect x="10" y="236" width="600" height="42" rx="4" fill="#f8fafc"/>
+  <rect x="10" y="278" width="600" height="42" rx="4" fill="#ffffff"/>
+
+  <!-- Manager -->
+  <rect x="25" y="76" width="110" height="26" rx="6" fill="#eef2ff" stroke="#6366f1" stroke-width="1"/>
+  <text x="80" y="94" text-anchor="middle" class="role">Manager</text>
+  <rect x="200" y="76" width="100" height="26" rx="13" fill="#dcfce7" stroke="#22c55e" stroke-width="1.2"/>
+  <text x="250" y="94" text-anchor="middle" class="provider" fill="#166534">Gemini</text>
+  <line x1="300" y1="89" x2="340" y2="89" stroke="#d1d5db" stroke-width="1.5" marker-end="url(#ar-dim)"/>
+  <rect x="350" y="76" width="100" height="26" rx="13" fill="#f3f4f6" stroke="#d1d5db" stroke-width="1"/>
+  <text x="400" y="94" text-anchor="middle" class="provider" fill="#9ca3af">Claude</text>
+  <line x1="450" y1="89" x2="480" y2="89" stroke="#e5e7eb" stroke-width="1.5" marker-end="url(#ar-dim)"/>
+  <rect x="490" y="76" width="100" height="26" rx="13" fill="#fafafa" stroke="#e5e7eb" stroke-width="1"/>
+  <text x="540" y="94" text-anchor="middle" class="provider" fill="#d1d5db">Codex</text>
+
+  <!-- Coordinator -->
+  <rect x="25" y="118" width="110" height="26" rx="6" fill="#eef2ff" stroke="#6366f1" stroke-width="1"/>
+  <text x="80" y="136" text-anchor="middle" class="role">Coordinator</text>
+  <rect x="200" y="118" width="100" height="26" rx="13" fill="#dbeafe" stroke="#3b82f6" stroke-width="1.2"/>
+  <text x="250" y="136" text-anchor="middle" class="provider" fill="#1e40af">Claude</text>
+  <line x1="300" y1="131" x2="340" y2="131" stroke="#d1d5db" stroke-width="1.5" marker-end="url(#ar-dim)"/>
+  <rect x="350" y="118" width="100" height="26" rx="13" fill="#f3f4f6" stroke="#d1d5db" stroke-width="1"/>
+  <text x="400" y="136" text-anchor="middle" class="provider" fill="#9ca3af">Gemini</text>
+  <line x1="450" y1="131" x2="480" y2="131" stroke="#e5e7eb" stroke-width="1.5" marker-end="url(#ar-dim)"/>
+  <rect x="490" y="118" width="100" height="26" rx="13" fill="#fafafa" stroke="#e5e7eb" stroke-width="1"/>
+  <text x="540" y="136" text-anchor="middle" class="provider" fill="#d1d5db">Codex</text>
+
+  <!-- Developer -->
+  <rect x="25" y="160" width="110" height="26" rx="6" fill="#eef2ff" stroke="#6366f1" stroke-width="1"/>
+  <text x="80" y="178" text-anchor="middle" class="role">Developer</text>
+  <rect x="200" y="160" width="100" height="26" rx="13" fill="#fce7f3" stroke="#ec4899" stroke-width="1.2"/>
+  <text x="250" y="178" text-anchor="middle" class="provider" fill="#9d174d">Codex</text>
+  <line x1="300" y1="173" x2="340" y2="173" stroke="#d1d5db" stroke-width="1.5" marker-end="url(#ar-dim)"/>
+  <rect x="350" y="160" width="100" height="26" rx="13" fill="#f3f4f6" stroke="#d1d5db" stroke-width="1"/>
+  <text x="400" y="178" text-anchor="middle" class="provider" fill="#9ca3af">Claude</text>
+  <line x1="450" y1="173" x2="480" y2="173" stroke="#e5e7eb" stroke-width="1.5" marker-end="url(#ar-dim)"/>
+  <rect x="490" y="160" width="100" height="26" rx="13" fill="#fafafa" stroke="#e5e7eb" stroke-width="1"/>
+  <text x="540" y="178" text-anchor="middle" class="provider" fill="#d1d5db">Gemini</text>
+
+  <!-- Researcher -->
+  <rect x="25" y="202" width="110" height="26" rx="6" fill="#eef2ff" stroke="#6366f1" stroke-width="1"/>
+  <text x="80" y="220" text-anchor="middle" class="role">Researcher</text>
+  <rect x="200" y="202" width="100" height="26" rx="13" fill="#dcfce7" stroke="#22c55e" stroke-width="1.2"/>
+  <text x="250" y="220" text-anchor="middle" class="provider" fill="#166534">Gemini</text>
+  <line x1="300" y1="215" x2="340" y2="215" stroke="#d1d5db" stroke-width="1.5" marker-end="url(#ar-dim)"/>
+  <rect x="350" y="202" width="100" height="26" rx="13" fill="#f3f4f6" stroke="#d1d5db" stroke-width="1"/>
+  <text x="400" y="220" text-anchor="middle" class="provider" fill="#9ca3af">Claude</text>
+  <line x1="450" y1="215" x2="480" y2="215" stroke="#e5e7eb" stroke-width="1.5" marker-end="url(#ar-dim)"/>
+  <rect x="490" y="202" width="100" height="26" rx="13" fill="#fafafa" stroke="#e5e7eb" stroke-width="1"/>
+  <text x="540" y="220" text-anchor="middle" class="provider" fill="#d1d5db">Codex</text>
+
+  <!-- Reviewer -->
+  <rect x="25" y="244" width="110" height="26" rx="6" fill="#eef2ff" stroke="#6366f1" stroke-width="1"/>
+  <text x="80" y="262" text-anchor="middle" class="role">Reviewer</text>
+  <rect x="200" y="244" width="100" height="26" rx="13" fill="#dbeafe" stroke="#3b82f6" stroke-width="1.2"/>
+  <text x="250" y="262" text-anchor="middle" class="provider" fill="#1e40af">Claude</text>
+  <line x1="300" y1="257" x2="340" y2="257" stroke="#d1d5db" stroke-width="1.5" marker-end="url(#ar-dim)"/>
+  <rect x="350" y="244" width="100" height="26" rx="13" fill="#f3f4f6" stroke="#d1d5db" stroke-width="1"/>
+  <text x="400" y="262" text-anchor="middle" class="provider" fill="#9ca3af">Gemini</text>
+  <line x1="450" y1="257" x2="480" y2="257" stroke="#e5e7eb" stroke-width="1.5" marker-end="url(#ar-dim)"/>
+  <rect x="490" y="244" width="100" height="26" rx="13" fill="#fafafa" stroke="#e5e7eb" stroke-width="1"/>
+  <text x="540" y="262" text-anchor="middle" class="provider" fill="#d1d5db">Codex</text>
+
+  <!-- Architect -->
+  <rect x="25" y="286" width="110" height="26" rx="6" fill="#eef2ff" stroke="#6366f1" stroke-width="1"/>
+  <text x="80" y="304" text-anchor="middle" class="role">Architect</text>
+  <rect x="200" y="286" width="100" height="26" rx="13" fill="#dbeafe" stroke="#3b82f6" stroke-width="1.2"/>
+  <text x="250" y="304" text-anchor="middle" class="provider" fill="#1e40af">Claude</text>
+  <line x1="300" y1="299" x2="340" y2="299" stroke="#d1d5db" stroke-width="1.5" marker-end="url(#ar-dim)"/>
+  <rect x="350" y="286" width="100" height="26" rx="13" fill="#f3f4f6" stroke="#d1d5db" stroke-width="1"/>
+  <text x="400" y="304" text-anchor="middle" class="provider" fill="#9ca3af">Gemini</text>
+  <line x1="450" y1="299" x2="480" y2="299" stroke="#e5e7eb" stroke-width="1.5" marker-end="url(#ar-dim)"/>
+  <rect x="490" y="286" width="100" height="26" rx="13" fill="#fafafa" stroke="#e5e7eb" stroke-width="1"/>
+  <text x="540" y="304" text-anchor="middle" class="provider" fill="#d1d5db">Codex</text>
+</svg>
diff --git a/package.json b/package.json
index 5dc8504..e05e0b7 100644
--- a/package.json
+++ b/package.json
@@ -32,8 +32,8 @@
   ],
   "dependencies": {
     "@opencode-ai/plugin": "^1.2.26",
-    "cockatiel": "^3.2.1",
-    "execa": "^9.6.1"
+    "execa": "^9.6.1",
+    "zod": "^3.23.0"
   },
   "devDependencies": {
     "@types/bun": "latest",
diff --git a/src/circuit-breaker.ts b/src/circuit-breaker.ts
index a56488b..e1e267d 100644
--- a/src/circuit-breaker.ts
+++ b/src/circuit-breaker.ts
@@ -3,14 +3,14 @@
  *
  * States:
  *   closed    → normal operation, requests pass through
- *   open      → too many failures, requests are blocked
+ *   open      → too many failures OR too many timeouts, requests are blocked
  *   half-open → cooldown elapsed, one probe request allowed
  *
  * Transitions:
- *   closed  →(N failures)→  open
- *   open    →(cooldown)→    half-open
- *   half-open →(success)→   closed
- *   half-open →(failure)→   open
+ *   closed  →(N failures OR M timeouts)→  open
+ *   open    →(cooldown)→                  half-open
+ *   half-open →(success)→                 closed
+ *   half-open →(failure/timeout)→         open
  */
 
 export type CircuitState = "closed" | "open" | "half-open"
@@ -18,15 +18,21 @@ export type CircuitState = "closed" | "open" | "half-open"
 export interface CircuitBreaker {
   state: CircuitState
   failures: number
+  timeouts: number
   successes: number
   lastFailure: number | null
   lastSuccess: number | null
   openedAt: number | null
+  totalExecutions: number
+  totalFailures: number
+  totalTimeouts: number
 }
 
 export interface BreakerConfig {
   /** Consecutive failures before opening the circuit */
   failureThreshold: number
+  /** Consecutive timeouts before opening (higher than failures: slow ≠ broken) */
+  timeoutThreshold: number
   /** Ms to wait before transitioning from open → half-open */
   cooldownMs: number
   /** Successes in half-open needed to close the circuit */
@@ -35,6 +41,7 @@ export interface BreakerConfig {
 
 export const DEFAULT_BREAKER_CONFIG: BreakerConfig = {
   failureThreshold: 3,
+  timeoutThreshold: 5,
   cooldownMs: 60_000,
   halfOpenSuccessThreshold: 1,
 }
@@ -43,10 +50,14 @@ export function createBreaker(): CircuitBreaker {
   return {
     state: "closed",
     failures: 0,
+    timeouts: 0,
     successes: 0,
     lastFailure: null,
     lastSuccess: null,
     openedAt: null,
+    totalExecutions: 0,
+    totalFailures: 0,
+    totalTimeouts: 0,
   }
 }
 
@@ -75,8 +86,10 @@ export function recordSuccess(
   config: BreakerConfig = DEFAULT_BREAKER_CONFIG,
   now: number = Date.now(),
 ): void {
+  breaker.totalExecutions++
   breaker.lastSuccess = now
   breaker.failures = 0
+  breaker.timeouts = 0
 
   if (breaker.state === "half-open") {
     breaker.successes++
@@ -93,6 +106,8 @@ export function recordFailure(
   config: BreakerConfig = DEFAULT_BREAKER_CONFIG,
   now: number = Date.now(),
 ): void {
+  breaker.totalExecutions++
+  breaker.totalFailures++
   breaker.failures++
   breaker.lastFailure = now
 
@@ -107,3 +122,25 @@ export function recordFailure(
     breaker.openedAt = now
   }
 }
+
+export function recordTimeout(
+  breaker: CircuitBreaker,
+  config: BreakerConfig = DEFAULT_BREAKER_CONFIG,
+  now: number = Date.now(),
+): void {
+  breaker.totalExecutions++
+  breaker.totalTimeouts++
+  breaker.timeouts++
+  breaker.lastFailure = now
+
+  if (breaker.state === "half-open") {
+    breaker.state = "open"
+    breaker.openedAt = now
+    return
+  }
+
+  if (breaker.timeouts >= config.timeoutThreshold) {
+    breaker.state = "open"
+    breaker.openedAt = now
+  }
+}
diff --git a/src/cli-defs.ts b/src/cli-defs.ts
index c1ac716..128e5dd 100644
--- a/src/cli-defs.ts
+++ b/src/cli-defs.ts
@@ -20,22 +20,22 @@ export const CLI_DEFS: Record<CliName, CliDef> = {
   claude: {
     name: "claude",
     description: "Anthropic Claude — strong reasoning, code analysis, complex logic",
-    strengths: ["reasoning", "code-analysis", "debugging", "architecture"],
+    strengths: ["reasoning", "code-analysis", "debugging", "architecture", "planning"],
     binary: "claude",
     buildArgs: (prompt, mode) =>
       mode === "analyze"
-        ? ["-p", prompt, "--max-turns", "10"]
+        ? ["-p", prompt]
         : ["-p", prompt, "--allowedTools", ""],
     buildStdinArgs: (mode) =>
       mode === "analyze"
-        ? ["-p", "-", "--max-turns", "10"]
+        ? ["-p", "-"]
         : ["-p", "-", "--allowedTools", ""],
     fallbackOrder: ["gemini", "codex"],
   },
   gemini: {
     name: "gemini",
     description: "Google Gemini — research, trends, broad knowledge, large context",
-    strengths: ["research", "trends", "knowledge", "large-context"],
+    strengths: ["research", "trends", "knowledge", "large-context", "web-search"],
     binary: "gemini",
     buildArgs: (prompt, _mode) => ["-e", "none", "-p", prompt],
     buildStdinArgs: (_mode) => ["-e", "none"],
@@ -44,7 +44,7 @@ export const CLI_DEFS: Record<CliName, CliDef> = {
   codex: {
     name: "codex",
     description: "OpenAI Codex — code generation, edits, refactoring",
-    strengths: ["code-generation", "edits", "refactoring"],
+    strengths: ["code-generation", "edits", "refactoring", "full-auto"],
     binary: "codex",
     buildArgs: (prompt, _mode) => ["exec", prompt, "--full-auto"],
     buildStdinArgs: (_mode) => ["exec", "-", "--full-auto"],
@@ -53,3 +53,38 @@ export const CLI_DEFS: Record<CliName, CliDef> = {
 }
 
 export const ALL_CLI_NAMES: CliName[] = ["claude", "gemini", "codex"]
+
+/**
+ * Generate CLI-specific args that hint at timeout constraints.
+ * Claude: --max-turns scales with available time (~1 turn per 30s).
+ * Gemini/Codex: no known timeout flags.
+ */
+export function buildTimeoutArgs(
+  provider: CliName,
+  remainingSeconds: number,
+): string[] {
+  switch (provider) {
+    case "claude": {
+      const maxTurns = Math.max(2, Math.min(25, Math.floor(remainingSeconds / 30)))
+      return ["--max-turns", String(maxTurns)]
+    }
+    case "gemini":
+      return []
+    case "codex":
+      return []
+  }
+}
+
+// ── Role-based routing ──────────────────────────────────────────────────────
+
+export const AGENT_ROLES = ["manager", "coordinator", "developer", "researcher", "reviewer", "architect"] as const
+export type AgentRole = (typeof AGENT_ROLES)[number]
+
+export const ROLE_ROUTING: Record<AgentRole, { primary: CliName; fallbacks: CliName[] }> = {
+  manager: { primary: "gemini", fallbacks: ["claude", "codex"] },
+  coordinator: { primary: "claude", fallbacks: ["gemini", "codex"] },
+  developer: { primary: "codex", fallbacks: ["claude", "gemini"] },
+  researcher: { primary: "gemini", fallbacks: ["claude", "codex"] },
+  reviewer: { primary: "claude", fallbacks: ["gemini", "codex"] },
+  architect: { primary: "claude", fallbacks: ["gemini", "codex"] },
+}
diff --git a/src/detection.ts b/src/detection.ts
index fff225b..0c453b1 100644
--- a/src/detection.ts
+++ b/src/detection.ts
@@ -1,11 +1,14 @@
 /**
  * CLI Auto-Detection — probes the system for installed CLI binaries.
+ * Caches results for 5 minutes to avoid repeated filesystem lookups.
  */
 
 import { execa } from "execa"
 import { IS_WINDOWS } from "./platform"
 import { CLI_DEFS, ALL_CLI_NAMES, type CliName } from "./cli-defs"
 
+const CACHE_TTL_MS = 5 * 60 * 1000 // 5 minutes
+
 export interface CliAvailability {
   installed: boolean
   path: string | null
@@ -13,25 +16,43 @@ export interface CliAvailability {
   checkedAt: number
 }
 
+interface CacheEntry {
+  result: CliAvailability
+  timestamp: number
+}
+
+const cache = new Map<CliName, CacheEntry>()
+
+function isCacheValid(entry: CacheEntry): boolean {
+  return Date.now() - entry.timestamp < CACHE_TTL_MS
+}
+
 export async function detectCli(name: CliName): Promise<CliAvailability> {
+  const cached = cache.get(name)
+  if (cached && isCacheValid(cached)) return cached.result
+
   const def = CLI_DEFS[name]
   const whichBin = IS_WINDOWS ? "where" : "which"
 
   try {
-    const { stdout } = await execa(whichBin, [def.binary], { timeout: 5_000 })
-    const path = stdout.trim().split("\n")[0] ?? null
+    const { stdout } = await execa(whichBin, [def.binary], { timeout: 5_000, windowsHide: true })
+    const path = stdout.trim().split(/\r?\n/)[0] ?? null
 
     let version: string | null = null
     try {
-      const vResult = await execa(def.binary, ["--version"], { timeout: 5_000 })
-      version = vResult.stdout.trim().split("\n")[0] ?? null
+      const vResult = await execa(def.binary, ["--version"], { timeout: 5_000, windowsHide: true })
+      version = vResult.stdout.trim().split(/\r?\n/)[0] ?? null
     } catch {
       // version check is best-effort
     }
 
-    return { installed: true, path, version, checkedAt: Date.now() }
+    const result: CliAvailability = { installed: true, path, version, checkedAt: Date.now() }
+    cache.set(name, { result, timestamp: Date.now() })
+    return result
   } catch {
-    return { installed: false, path: null, version: null, checkedAt: Date.now() }
+    const result: CliAvailability = { installed: false, path: null, version: null, checkedAt: Date.now() }
+    cache.set(name, { result, timestamp: Date.now() })
+    return result
   }
 }
 
@@ -47,3 +68,7 @@ export async function detectAllClis(): Promise<Map<CliName, CliAvailability>> {
   }
   return results
 }
+
+export function getDetectionCache(): Map<CliName, CliAvailability> {
+  return new Map([...cache].map(([k, v]) => [k, v.result]))
+}
diff --git a/src/executor.ts b/src/executor.ts
index b8f886b..7abcfa6 100644
--- a/src/executor.ts
+++ b/src/executor.ts
@@ -1,66 +1,114 @@
 /**
  * Core Execution Engine — runs a CLI binary via execa with structured output.
+ * Returns structured results (never throws) to support global time budget.
  */
 
 import { execa } from "execa"
+import path from "node:path"
+import os from "node:os"
 import type { CliDef } from "./cli-defs"
+import { buildTimeoutArgs } from "./cli-defs"
 import { getSafeEnv } from "./safe-env"
 import { redactSecrets } from "./redact"
+import { IS_WINDOWS } from "./platform"
 
 /** Prompts longer than this (chars) are delivered via stdin to avoid OS arg-length limits. */
 export const STDIN_THRESHOLD = 30_000
+const MAX_BUFFER = 10 * 1024 * 1024 // 10MB
 
 export interface ExecResult {
   stdout: string
   stderr: string
+  exitCode: number
   durationMs: number
+  timedOut: boolean
+}
+
+/**
+ * On Windows, CLIs installed via npm/scoop/cargo may be .cmd/.bat shims.
+ * Wrap with `cmd /c` so execa can execute them without a shell.
+ */
+function resolveCommand(binary: string): { file: string; prefix: string[] } {
+  if (!IS_WINDOWS) return { file: binary, prefix: [] }
+
+  const ext = path.extname(binary).toLowerCase()
+  if (ext === ".cmd" || ext === ".bat") {
+    return { file: "cmd", prefix: ["/c", binary] }
+  }
+
+  const pathext = (process.env.PATHEXT || "").toLowerCase()
+  if (pathext.includes(".cmd") || pathext.includes(".bat")) {
+    return { file: "cmd", prefix: ["/c", binary] }
+  }
+
+  return { file: binary, prefix: [] }
+}
+
+/** Enhance PATH on Windows with common CLI install locations */
+function getEnhancedPath(): string | undefined {
+  if (!IS_WINDOWS) return undefined
+
+  const home = os.homedir()
+  const extraPaths = [
+    path.join(home, "AppData", "Roaming", "npm"),
+    path.join(home, "scoop", "shims"),
+    path.join(home, ".cargo", "bin"),
+    path.join(home, "AppData", "Local", "pnpm"),
+  ]
+
+  const currentPath = process.env.PATH || ""
+  return [...extraPaths, currentPath].join(path.delimiter)
 }
 
 export async function executeCliOnce(
   def: CliDef,
   prompt: string,
   mode: string,
-  timeoutMs: number,
+  timeoutSeconds: number,
   signal?: AbortSignal,
 ): Promise<ExecResult> {
   const useStdin = def.buildStdinArgs != null && prompt.length > STDIN_THRESHOLD
-  const args = useStdin ? def.buildStdinArgs!(mode) : def.buildArgs(prompt, mode)
-  const start = Date.now()
-
-  const result = await execa(def.binary, args, {
-    timeout: timeoutMs,
-    maxBuffer: 10 * 1024 * 1024,
-    reject: false,
-    windowsHide: true,
-    env: getSafeEnv(),
-    ...(useStdin ? { input: prompt } : {}),
-    ...(signal ? { cancelSignal: signal } : {}),
-  })
+  const baseArgs = useStdin ? def.buildStdinArgs!(mode) : def.buildArgs(prompt, mode)
+  const timeoutHints = buildTimeoutArgs(def.name, timeoutSeconds)
+  const args = [...baseArgs, ...timeoutHints]
 
-  const durationMs = Date.now() - start
+  const { file, prefix } = resolveCommand(def.binary)
+  const finalArgs = [...prefix, ...args]
 
-  if (result.isCanceled) {
-    throw Object.assign(new Error(`CLI '${def.name}' was canceled`), { canceled: true })
+  const env = getSafeEnv()
+  const enhancedPath = getEnhancedPath()
+  if (enhancedPath) {
+    env.PATH = enhancedPath
   }
 
-  if (result.timedOut) {
-    throw Object.assign(new Error(`CLI '${def.name}' timed out after ${timeoutMs}ms`), {
-      timedOut: true,
-    })
-  }
+  const start = Date.now()
 
-  if (result.failed && result.exitCode !== 0) {
-    const rawMsg = result.stderr?.trim() || result.message || `Exit code ${result.exitCode}`
-    const msg = redactSecrets(rawMsg)
-    throw Object.assign(new Error(`CLI '${def.name}' failed: ${msg}`), {
-      exitCode: result.exitCode,
+  try {
+    const result = await execa(file, finalArgs, {
+      timeout: timeoutSeconds * 1000,
+      maxBuffer: MAX_BUFFER,
+      reject: false,
+      windowsHide: true,
+      env,
+      ...(useStdin ? { input: prompt } : {}),
+      ...(signal ? { cancelSignal: signal } : {}),
     })
-  }
 
-  return {
-    stdout: result.stdout ?? "",
-    stderr: result.stderr ?? "",
-    durationMs,
+    return {
+      stdout: result.stdout || "",
+      stderr: redactSecrets(result.stderr || ""),
+      exitCode: result.exitCode ?? 1,
+      durationMs: Date.now() - start,
+      timedOut: result.timedOut ?? false,
+    }
+  } catch (error: any) {
+    return {
+      stdout: "",
+      stderr: redactSecrets(error.message || "Execution failed"),
+      exitCode: 1,
+      durationMs: Date.now() - start,
+      timedOut: !!error.timedOut,
+    }
   }
 }
 
diff --git a/src/index.ts b/src/index.ts
index 266c4a7..fb24e5f 100644
--- a/src/index.ts
+++ b/src/index.ts
@@ -7,6 +7,8 @@
  * Tools exposed:
  *   cli_exec   — Execute a CLI with full resilience pipeline
  *   cli_status — Health check and observability dashboard
+ *   cli_list   — List installed CLI providers
+ *   cli_route  — Recommend best CLI by agent role
  *
  * Hook:
  *   experimental.chat.system.transform — injects CLI availability into agent prompts
@@ -18,7 +20,7 @@ import { tool } from "@opencode-ai/plugin"
 import { z } from "zod"
 
 import { PLATFORM } from "./platform"
-import { CLI_DEFS, ALL_CLI_NAMES, type CliName } from "./cli-defs"
+import { CLI_DEFS, ALL_CLI_NAMES, AGENT_ROLES, ROLE_ROUTING, type CliName } from "./cli-defs"
 import { createBreaker, DEFAULT_BREAKER_CONFIG } from "./circuit-breaker"
 import { DEFAULT_RETRY_CONFIG } from "./retry"
 import { detectAllClis, type CliAvailability } from "./detection"
@@ -89,12 +91,13 @@ export default ((ctx) => {
 ## External CLI Tools (${PLATFORM})
 Use \`cli_exec\` to call external LLMs — it handles OS differences, timeout, retry, and automatic fallback.
 Use \`cli_status\` to check health and availability of all CLI providers.
+Use \`cli_list\` to see installed providers. Use \`cli_route\` for role-based CLI recommendations.
 
 | CLI | Description | Strengths | Health |
 |-----|-------------|-----------|--------|
 ${rows}
 ${unavailableNote}
-Features: auto-retry (${DEFAULT_RETRY_CONFIG.maxRetries}x with backoff), circuit breaker per CLI, fallback to next available provider.
+Features: auto-retry (${DEFAULT_RETRY_CONFIG.maxRetries}x with backoff), circuit breaker per CLI, fallback to next available provider, global time budget.
 Rules: One concern per call. Split large requests. Include "CLI Consultations" in output.
 `
   }
@@ -107,7 +110,8 @@ Rules: One concern per call. Split large requests. Include "CLI Consultations" i
         name: "cli_exec",
         description:
           "Execute an external CLI (claude, gemini, codex) with automatic OS detection, timeout, " +
-          "retry with exponential backoff, circuit breaker protection, and fallback to alternative providers.",
+          "retry with exponential backoff, circuit breaker protection, and fallback to alternative providers. " +
+          "Uses a global time budget shared across all retries and fallbacks.",
         parameters: z.object({
           cli: z.enum(["claude", "gemini", "codex"]).describe("Primary CLI to invoke"),
           prompt: z.string().min(1).max(100_000).describe("The prompt to send to the CLI"),
@@ -121,7 +125,7 @@ Rules: One concern per call. Split large requests. Include "CLI Consultations" i
             .min(10)
             .max(1800)
             .default(720)
-            .describe("Max seconds before killing the process"),
+            .describe("Global timeout budget in seconds (covers all retries and fallbacks)"),
           allow_fallback: z
             .boolean()
             .default(true)
@@ -174,7 +178,9 @@ Rules: One concern per call. Split large requests. Include "CLI Consultations" i
               circuit_breaker: {
                 state: breaker.state,
                 consecutive_failures: breaker.failures,
+                consecutive_timeouts: breaker.timeouts,
                 failure_threshold: DEFAULT_BREAKER_CONFIG.failureThreshold,
+                timeout_threshold: DEFAULT_BREAKER_CONFIG.timeoutThreshold,
                 cooldown_seconds: DEFAULT_BREAKER_CONFIG.cooldownMs / 1000,
                 opened_at: breaker.openedAt ? new Date(breaker.openedAt).toISOString() : null,
                 last_failure: breaker.lastFailure
@@ -183,6 +189,9 @@ Rules: One concern per call. Split large requests. Include "CLI Consultations" i
                 last_success: breaker.lastSuccess
                   ? new Date(breaker.lastSuccess).toISOString()
                   : null,
+                total_executions: breaker.totalExecutions,
+                total_failures: breaker.totalFailures,
+                total_timeouts: breaker.totalTimeouts,
               },
               usage: {
                 total_calls: stats.calls,
@@ -207,12 +216,75 @@ Rules: One concern per call. Split large requests. Include "CLI Consultations" i
             },
             breaker_config: {
               failure_threshold: DEFAULT_BREAKER_CONFIG.failureThreshold,
+              timeout_threshold: DEFAULT_BREAKER_CONFIG.timeoutThreshold,
               cooldown_seconds: DEFAULT_BREAKER_CONFIG.cooldownMs / 1000,
             },
             providers,
           }
         },
       }),
+
+      tool({
+        name: "cli_list",
+        description: "List installed CLI providers with their paths, versions, and strengths.",
+        parameters: z.object({}),
+        execute: async () => {
+          await detectionPromise
+
+          const installed: { provider: CliName; path: string | null; version: string | null; strengths: string[] }[] = []
+          for (const name of ALL_CLI_NAMES) {
+            const avail = cliAvailability.get(name)
+            if (avail?.installed) {
+              installed.push({
+                provider: name,
+                path: avail.path,
+                version: avail.version,
+                strengths: CLI_DEFS[name].strengths,
+              })
+            }
+          }
+
+          return {
+            installed_count: installed.length,
+            providers: installed,
+          }
+        },
+      }),
+
+      tool({
+        name: "cli_route",
+        description:
+          "Suggest the best CLI for a task based on agent role. " +
+          "Returns recommended provider with reasoning and fallback chain.",
+        parameters: z.object({
+          role: z.enum(AGENT_ROLES).describe("Agent role (manager, coordinator, developer, researcher, reviewer, architect)"),
+          task_description: z.string().optional().describe("Brief task description for context"),
+        }),
+        execute: async ({ role, task_description }) => {
+          await detectionPromise
+
+          const routing = ROLE_ROUTING[role]
+          const chain = [routing.primary, ...routing.fallbacks] as CliName[]
+
+          const availability: Record<string, boolean> = {}
+          for (const provider of chain) {
+            const det = cliAvailability.get(provider)
+            const breaker = breakers.get(provider)!
+            availability[provider] = (det?.installed ?? false) && breaker.state !== "open"
+          }
+
+          const recommended = chain.find((p) => availability[p]) || routing.primary
+
+          return {
+            role,
+            task_description: task_description ?? null,
+            recommended_cli: recommended,
+            reasoning: `Role "${role}" maps to ${routing.primary} (${CLI_DEFS[routing.primary].strengths.join(", ")})${recommended !== routing.primary ? `. Falling back to ${recommended} because ${routing.primary} is unavailable.` : "."}`,
+            fallback_chain: chain,
+            availability,
+          }
+        },
+      }),
     ],
 
     hooks: {
diff --git a/src/resilience.ts b/src/resilience.ts
index 4a39e85..76027ca 100644
--- a/src/resilience.ts
+++ b/src/resilience.ts
@@ -1,6 +1,9 @@
 /**
  * Resilience Engine — orchestrates retry + circuit breaker + fallback
- * into a single execution pipeline.
+ * into a single execution pipeline with a global time budget.
+ *
+ * The global budget is shared across ALL retries and fallback attempts,
+ * preventing timeout multiplication (3 providers × 3 attempts × timeout).
  */
 
 import type { CliName } from "./cli-defs"
@@ -11,6 +14,7 @@ import {
   isBreakerAvailable,
   recordSuccess,
   recordFailure,
+  recordTimeout,
 } from "./circuit-breaker"
 import type { RetryConfig } from "./retry"
 import { DEFAULT_RETRY_CONFIG, calculateDelay, sleep } from "./retry"
@@ -21,7 +25,7 @@ import type { CircuitState } from "./circuit-breaker"
 import { classifyError, type ErrorClass } from "./error-classifier"
 import { redactSecrets } from "./redact"
 
-// ─── Structured Response (MCP pattern) ─────────────────────────────────────
+// ─── Structured Response ──────────────────────────────────────────────────
 
 export interface CliResponse {
   success: boolean
@@ -40,7 +44,7 @@ export interface CliResponse {
   max_attempts: number
 }
 
-// ─── Usage Stats ───────────────────────────────────────────────────────────
+// ─── Usage Stats ──────────────────────────────────────────────────────────
 
 export interface UsageStats {
   calls: number
@@ -48,7 +52,7 @@ export interface UsageStats {
   totalMs: number
 }
 
-// ─── Engine ────────────────────────────────────────────────────────────────
+// ─── Engine ───────────────────────────────────────────────────────────────
 
 export interface ResilienceContext {
   breakers: Map<CliName, CircuitBreaker>
@@ -59,6 +63,17 @@ export interface ResilienceContext {
   breakerConfig: BreakerConfig
 }
 
+/** Merge caller signal with budget signal (compatible with all runtimes) */
+function mergeAbortSignals(a?: AbortSignal, b?: AbortSignal): AbortSignal | undefined {
+  if (!a) return b
+  if (!b) return a
+  const controller = new AbortController()
+  const onAbort = () => controller.abort()
+  a.addEventListener("abort", onAbort, { once: true })
+  b.addEventListener("abort", onAbort, { once: true })
+  return controller.signal
+}
+
 export async function executeWithResilience(
   ctx: ResilienceContext,
   targetCli: CliName,
@@ -68,121 +83,167 @@ export async function executeWithResilience(
   allowFallback: boolean,
   signal?: AbortSignal,
 ): Promise<CliResponse> {
-  const timeoutMs = timeoutSeconds * 1000
   const def = CLI_DEFS[targetCli]
   const fallbackChain: string[] = [targetCli]
+  const errors: string[] = []
 
   // Build execution order: target first, then fallbacks
   const executionOrder: CliName[] = [targetCli]
   if (allowFallback) {
-    const available = ALL_CLI_NAMES.filter((n) => {
-      const avail = ctx.availability.get(n)
-      return avail?.installed !== false
-    })
     for (const fb of def.fallbackOrder) {
-      if (available.includes(fb)) executionOrder.push(fb)
+      const avail = ctx.availability.get(fb)
+      if (avail?.installed !== false) executionOrder.push(fb)
     }
   }
 
-  for (const cliName of executionOrder) {
-    const currentDef = CLI_DEFS[cliName]
-    const breaker = ctx.breakers.get(cliName)!
-    const stats = ctx.usageStats.get(cliName)!
+  // Global time budget: entire chain (retries + fallbacks) must fit within timeoutSeconds
+  const globalDeadline = Date.now() + timeoutSeconds * 1000
+  const budgetController = new AbortController()
+  const budgetTimeout = setTimeout(() => budgetController.abort(), timeoutSeconds * 1000)
+  const mergedSignal = mergeAbortSignals(signal, budgetController.signal)
+
+  try {
+    for (const cliName of executionOrder) {
+      const remaining = globalDeadline - Date.now()
+      if (remaining <= 0) {
+        errors.push(`${cliName}: global budget exhausted`)
+        break
+      }
 
-    // Check circuit breaker
-    if (!isBreakerAvailable(breaker, ctx.breakerConfig)) {
-      if (cliName !== targetCli) fallbackChain.push(`${cliName}(circuit-open)`)
-      continue
-    }
+      const currentDef = CLI_DEFS[cliName]
+      const breaker = ctx.breakers.get(cliName)!
+      const stats = ctx.usageStats.get(cliName)!
 
-    // Check availability
-    const avail = ctx.availability.get(cliName)
-    if (avail?.installed === false) {
-      if (cliName !== targetCli) fallbackChain.push(`${cliName}(not-installed)`)
-      continue
-    }
+      // Check circuit breaker
+      if (!isBreakerAvailable(breaker, ctx.breakerConfig)) {
+        if (cliName !== targetCli) fallbackChain.push(`${cliName}(circuit-open)`)
+        errors.push(`${cliName}: circuit breaker open`)
+        continue
+      }
 
-    if (cliName !== targetCli) fallbackChain.push(cliName)
+      // Check availability
+      const avail = ctx.availability.get(cliName)
+      if (avail?.installed === false) {
+        if (cliName !== targetCli) fallbackChain.push(`${cliName}(not-installed)`)
+        errors.push(`${cliName}: not installed`)
+        continue
+      }
 
-    // Retry loop
-    for (let attempt = 0; attempt <= ctx.retryConfig.maxRetries; attempt++) {
-      if (signal?.aborted) break
+      if (cliName !== targetCli) fallbackChain.push(cliName)
 
-      if (attempt > 0) {
-        const delay = calculateDelay(attempt - 1, ctx.retryConfig)
-        await sleep(delay)
-      }
+      // Retry loop
+      for (let attempt = 0; attempt <= ctx.retryConfig.maxRetries; attempt++) {
+        if (mergedSignal?.aborted) {
+          errors.push(`${cliName}: aborted`)
+          break
+        }
+
+        const remainingSeconds = Math.max(1, Math.floor((globalDeadline - Date.now()) / 1000))
+        if (remainingSeconds <= 1) {
+          errors.push(`${cliName}: global budget exhausted`)
+          break
+        }
+
+        if (attempt > 0) {
+          const delay = calculateDelay(attempt - 1, ctx.retryConfig)
+          try {
+            await sleep(delay, mergedSignal)
+          } catch {
+            errors.push(`${cliName}: aborted during retry backoff`)
+            break
+          }
+        }
 
-      try {
         stats.calls++
-        const result = await executeCliOnce(currentDef, prompt, mode, timeoutMs, signal)
-
-        recordSuccess(breaker, ctx.breakerConfig)
-        stats.totalMs += result.durationMs
-
-        return {
-          success: true,
-          cli: cliName,
-          platform: ctx.platform,
-          stdout: result.stdout,
-          stderr: result.stderr,
-          duration_ms: result.durationMs,
-          timed_out: false,
-          used_fallback: cliName !== targetCli,
-          fallback_chain: fallbackChain,
-          error: null,
-          error_class: null,
-          circuit_state: breaker.state,
-          attempt: attempt + 1,
-          max_attempts: ctx.retryConfig.maxRetries + 1,
+        const result = await executeCliOnce(currentDef, prompt, mode, remainingSeconds, mergedSignal)
+
+        if (result.exitCode === 0 && result.stdout) {
+          recordSuccess(breaker, ctx.breakerConfig)
+          stats.totalMs += result.durationMs
+
+          return {
+            success: true,
+            cli: cliName,
+            platform: ctx.platform,
+            stdout: result.stdout,
+            stderr: result.stderr,
+            duration_ms: result.durationMs,
+            timed_out: false,
+            used_fallback: cliName !== targetCli,
+            fallback_chain: fallbackChain,
+            error: null,
+            error_class: null,
+            circuit_state: breaker.state,
+            attempt: attempt + 1,
+            max_attempts: ctx.retryConfig.maxRetries + 1,
+          }
         }
-      } catch (err: unknown) {
-        stats.failures++
 
-        const errorClass = classifyError(err)
+        // Process timeout: skip retries, move to next provider immediately
+        if (result.timedOut) {
+          const err = redactSecrets(`${cliName}: process timeout (${result.durationMs}ms) — skipping retries`)
+          errors.push(err)
+          stats.failures++
+          recordTimeout(breaker, ctx.breakerConfig)
+          break
+        }
 
-        // permanent and crash errors: skip retries, fallback immediately
+        // Classify error for retry decision
+        const errorClass = classifyError({ message: result.stderr, exitCode: result.exitCode })
+
+        // Permanent and crash errors: skip retries, fallback immediately
         if (errorClass === "permanent" || errorClass === "crash") {
+          const err = redactSecrets(`${cliName}: ${result.stderr || "non-retryable failure"}`)
+          errors.push(err)
+          stats.failures++
           recordFailure(breaker, ctx.breakerConfig)
-          break // try next CLI in fallback chain
+          break
         }
 
-        // rate_limit: wait longer before retrying
-        if (errorClass === "rate_limit") {
+        // Rate limit: wait longer before retrying
+        if (errorClass === "rate_limit" && attempt < ctx.retryConfig.maxRetries) {
           const rateLimitDelay = calculateDelay(attempt + 1, {
             ...ctx.retryConfig,
             baseDelayMs: ctx.retryConfig.baseDelayMs * 3,
           })
-          await sleep(rateLimitDelay)
+          try {
+            await sleep(rateLimitDelay, mergedSignal)
+          } catch {
+            errors.push(`${cliName}: aborted during rate-limit backoff`)
+            break
+          }
         }
 
         const isLastAttempt = attempt === ctx.retryConfig.maxRetries
         if (isLastAttempt) {
+          const err = redactSecrets(`${cliName}: exhausted retries — ${result.stderr}`)
+          errors.push(err)
+          stats.failures++
           recordFailure(breaker, ctx.breakerConfig)
-          break // try next CLI in fallback chain
         }
-        // transient or rate_limit — loop continues with retry
       }
+
+      if (mergedSignal?.aborted) break
     }
-  }
 
-  // All CLIs exhausted
-  return {
-    success: false,
-    cli: targetCli,
-    platform: ctx.platform,
-    stdout: "",
-    stderr: "",
-    duration_ms: 0,
-    timed_out: false,
-    used_fallback: fallbackChain.length > 1,
-    fallback_chain: fallbackChain,
-    error: redactSecrets(
-      `All CLI providers exhausted. Tried: ${fallbackChain.join(" → ")}. Check cli_status for details.`,
-    ),
-    error_class: "transient",
-    circuit_state: ctx.breakers.get(targetCli)!.state,
-    attempt: ctx.retryConfig.maxRetries + 1,
-    max_attempts: ctx.retryConfig.maxRetries + 1,
+    // All CLIs exhausted
+    return {
+      success: false,
+      cli: targetCli,
+      platform: ctx.platform,
+      stdout: "",
+      stderr: "",
+      duration_ms: 0,
+      timed_out: false,
+      used_fallback: fallbackChain.length > 1,
+      fallback_chain: fallbackChain,
+      error: redactSecrets(`All providers failed: ${errors.join("; ")}`),
+      error_class: "transient",
+      circuit_state: ctx.breakers.get(targetCli)!.state,
+      attempt: ctx.retryConfig.maxRetries + 1,
+      max_attempts: ctx.retryConfig.maxRetries + 1,
+    }
+  } finally {
+    clearTimeout(budgetTimeout)
   }
 }
diff --git a/src/retry.ts b/src/retry.ts
index 0a237b2..8e4e982 100644
--- a/src/retry.ts
+++ b/src/retry.ts
@@ -31,8 +31,23 @@ export function calculateDelay(
   return Math.max(0, Math.round(capped + jitter))
 }
 
-export function sleep(ms: number): Promise<void> {
-  return new Promise((resolve) => setTimeout(resolve, ms))
+/** Abort-aware sleep — rejects immediately if signal fires during the wait. */
+export function sleep(ms: number, signal?: AbortSignal): Promise<void> {
+  return new Promise((resolve, reject) => {
+    if (signal?.aborted) {
+      reject(new Error("aborted"))
+      return
+    }
+    const timer = setTimeout(resolve, ms)
+    signal?.addEventListener(
+      "abort",
+      () => {
+        clearTimeout(timer)
+        reject(new Error("aborted"))
+      },
+      { once: true },
+    )
+  })
 }
 
 export function isRetryableError(error: unknown): boolean {
diff --git a/src/safe-env.ts b/src/safe-env.ts
index cd3540a..423245f 100644
--- a/src/safe-env.ts
+++ b/src/safe-env.ts
@@ -1,9 +1,13 @@
 /**
  * Environment Variable Filtering — only passes safe variables to
  * spawned CLI processes, preventing accidental secret leakage.
+ *
+ * CLIs handle their own auth inline (claude login, gcloud auth, etc.)
+ * so we just need system essentials + proxy settings.
  */
 
 export const SAFE_ENV_VARS = [
+  // System essentials
   "PATH",
   "HOME",
   "USER",
@@ -11,14 +15,24 @@ export const SAFE_ENV_VARS = [
   "SHELL",
   "LANG",
   "LC_ALL",
-  "ANTHROPIC_API_KEY",
-  "GOOGLE_API_KEY",
-  "OPENAI_API_KEY",
-  "GEMINI_API_KEY",
-  "CODEX_API_KEY",
+  // Windows
+  "USERPROFILE",
+  "SYSTEMROOT",
+  "SYSTEMDRIVE",
+  "COMSPEC",
+  "PATHEXT",
+  "TEMP",
+  "TMP",
+  "APPDATA",
+  "LOCALAPPDATA",
+  "PROGRAMFILES",
+  // Proxy
   "HTTP_PROXY",
   "HTTPS_PROXY",
   "NO_PROXY",
+  "http_proxy",
+  "https_proxy",
+  "no_proxy",
 ]
 
 export function getSafeEnv(): Record<string, string> {
diff --git a/tests/circuit-breaker.test.ts b/tests/circuit-breaker.test.ts
index 00ce857..4e99407 100644
--- a/tests/circuit-breaker.test.ts
+++ b/tests/circuit-breaker.test.ts
@@ -4,12 +4,14 @@ import {
   isBreakerAvailable,
   recordSuccess,
   recordFailure,
+  recordTimeout,
   DEFAULT_BREAKER_CONFIG,
   type BreakerConfig,
 } from "../src/circuit-breaker"
 
 const config: BreakerConfig = {
   failureThreshold: 3,
+  timeoutThreshold: 5,
   cooldownMs: 5_000,
   halfOpenSuccessThreshold: 1,
 }
@@ -19,6 +21,8 @@ describe("Circuit Breaker", () => {
     const b = createBreaker()
     expect(b.state).toBe("closed")
     expect(b.failures).toBe(0)
+    expect(b.timeouts).toBe(0)
+    expect(b.totalExecutions).toBe(0)
   })
 
   it("remains closed after fewer failures than threshold", () => {
@@ -92,4 +96,53 @@ describe("Circuit Breaker", () => {
     const b = createBreaker()
     expect(isBreakerAvailable(b, config)).toBe(true)
   })
+
+  it("tracks total executions across success and failure", () => {
+    const b = createBreaker()
+    recordSuccess(b, config)
+    recordFailure(b, config)
+    recordSuccess(b, config)
+    expect(b.totalExecutions).toBe(3)
+    expect(b.totalFailures).toBe(1)
+  })
+})
+
+describe("Circuit Breaker — Timeout Tracking", () => {
+  it("remains closed after fewer timeouts than threshold", () => {
+    const b = createBreaker()
+    for (let i = 0; i < 4; i++) recordTimeout(b, config)
+    expect(b.state).toBe("closed")
+    expect(b.timeouts).toBe(4)
+  })
+
+  it("opens after reaching timeout threshold (5)", () => {
+    const b = createBreaker()
+    for (let i = 0; i < 5; i++) recordTimeout(b, config)
+    expect(b.state).toBe("open")
+    expect(b.totalTimeouts).toBe(5)
+  })
+
+  it("resets timeout counter on success", () => {
+    const b = createBreaker()
+    recordTimeout(b, config)
+    recordTimeout(b, config)
+    recordSuccess(b, config)
+    expect(b.timeouts).toBe(0)
+  })
+
+  it("re-opens immediately on timeout in half-open state", () => {
+    const b = createBreaker()
+    const now = 1000
+    for (let i = 0; i < 5; i++) recordTimeout(b, config, now)
+    isBreakerAvailable(b, config, now + 5_000) // trigger half-open
+
+    recordTimeout(b, config, now + 5_001)
+    expect(b.state).toBe("open")
+  })
+
+  it("timeout threshold is higher than failure threshold (slow ≠ broken)", () => {
+    expect(DEFAULT_BREAKER_CONFIG.timeoutThreshold).toBeGreaterThan(
+      DEFAULT_BREAKER_CONFIG.failureThreshold,
+    )
+  })
 })
diff --git a/tests/cli-defs.test.ts b/tests/cli-defs.test.ts
index 9114a9f..4894726 100644
--- a/tests/cli-defs.test.ts
+++ b/tests/cli-defs.test.ts
@@ -1,5 +1,5 @@
 import { describe, it, expect } from "bun:test"
-import { CLI_DEFS, ALL_CLI_NAMES, type CliName } from "../src/cli-defs"
+import { CLI_DEFS, ALL_CLI_NAMES, AGENT_ROLES, ROLE_ROUTING, buildTimeoutArgs, type CliName } from "../src/cli-defs"
 
 describe("CLI Definitions", () => {
   it("defines all three CLIs", () => {
@@ -53,7 +53,6 @@ describe("CLI Definitions", () => {
       const args = CLI_DEFS.claude.buildStdinArgs!("analyze")
       expect(args).toContain("-p")
       expect(args).toContain("-")
-      expect(args).toContain("--max-turns")
     })
 
     it("gemini stdin mode does not include prompt", () => {
@@ -79,10 +78,10 @@ describe("CLI Definitions", () => {
       expect(args).toContain("test prompt")
     })
 
-    it("claude analyze mode includes --max-turns", () => {
+    it("claude analyze mode uses -p flag", () => {
       const args = CLI_DEFS.claude.buildArgs("test prompt", "analyze")
-      expect(args).toContain("--max-turns")
-      expect(args).toContain("10")
+      expect(args).toContain("-p")
+      expect(args).toContain("test prompt")
     })
 
     it("gemini builds correct args", () => {
@@ -99,3 +98,66 @@ describe("CLI Definitions", () => {
     })
   })
 })
+
+describe("buildTimeoutArgs", () => {
+  it("claude gets --max-turns based on remaining seconds", () => {
+    const args = buildTimeoutArgs("claude", 300)
+    expect(args).toContain("--max-turns")
+    expect(args).toContain("10") // 300/30 = 10
+  })
+
+  it("claude max-turns clamps at minimum 2", () => {
+    const args = buildTimeoutArgs("claude", 15)
+    expect(args).toContain("2")
+  })
+
+  it("claude max-turns clamps at maximum 25", () => {
+    const args = buildTimeoutArgs("claude", 1800)
+    expect(args).toContain("25")
+  })
+
+  it("gemini returns empty array", () => {
+    expect(buildTimeoutArgs("gemini", 300)).toEqual([])
+  })
+
+  it("codex returns empty array", () => {
+    expect(buildTimeoutArgs("codex", 300)).toEqual([])
+  })
+})
+
+describe("Role Routing", () => {
+  it("defines all 6 agent roles", () => {
+    expect(AGENT_ROLES).toHaveLength(6)
+    expect(AGENT_ROLES).toContain("manager")
+    expect(AGENT_ROLES).toContain("developer")
+    expect(AGENT_ROLES).toContain("architect")
+  })
+
+  it("each role maps to a valid primary provider", () => {
+    for (const role of AGENT_ROLES) {
+      expect(ALL_CLI_NAMES).toContain(ROLE_ROUTING[role].primary)
+    }
+  })
+
+  it("each role has valid fallbacks", () => {
+    for (const role of AGENT_ROLES) {
+      const routing = ROLE_ROUTING[role]
+      for (const fb of routing.fallbacks) {
+        expect(ALL_CLI_NAMES).toContain(fb)
+      }
+      expect(routing.fallbacks).not.toContain(routing.primary)
+    }
+  })
+
+  it("developer routes to codex", () => {
+    expect(ROLE_ROUTING.developer.primary).toBe("codex")
+  })
+
+  it("researcher routes to gemini", () => {
+    expect(ROLE_ROUTING.researcher.primary).toBe("gemini")
+  })
+
+  it("architect routes to claude", () => {
+    expect(ROLE_ROUTING.architect.primary).toBe("claude")
+  })
+})
diff --git a/tests/safe-env.test.ts b/tests/safe-env.test.ts
index a55ee35..375e9a8 100644
--- a/tests/safe-env.test.ts
+++ b/tests/safe-env.test.ts
@@ -6,16 +6,26 @@ describe("SAFE_ENV_VARS", () => {
     expect(SAFE_ENV_VARS).toContain("PATH")
   })
 
-  it("includes API key vars", () => {
-    expect(SAFE_ENV_VARS).toContain("ANTHROPIC_API_KEY")
-    expect(SAFE_ENV_VARS).toContain("GOOGLE_API_KEY")
-    expect(SAFE_ENV_VARS).toContain("OPENAI_API_KEY")
+  it("does NOT include API key vars (CLIs handle their own auth)", () => {
+    expect(SAFE_ENV_VARS).not.toContain("ANTHROPIC_API_KEY")
+    expect(SAFE_ENV_VARS).not.toContain("GOOGLE_API_KEY")
+    expect(SAFE_ENV_VARS).not.toContain("OPENAI_API_KEY")
   })
 
-  it("includes proxy vars", () => {
+  it("includes Windows system vars", () => {
+    expect(SAFE_ENV_VARS).toContain("USERPROFILE")
+    expect(SAFE_ENV_VARS).toContain("SYSTEMROOT")
+    expect(SAFE_ENV_VARS).toContain("APPDATA")
+    expect(SAFE_ENV_VARS).toContain("PATHEXT")
+  })
+
+  it("includes proxy vars with both casings", () => {
     expect(SAFE_ENV_VARS).toContain("HTTP_PROXY")
     expect(SAFE_ENV_VARS).toContain("HTTPS_PROXY")
     expect(SAFE_ENV_VARS).toContain("NO_PROXY")
+    expect(SAFE_ENV_VARS).toContain("http_proxy")
+    expect(SAFE_ENV_VARS).toContain("https_proxy")
+    expect(SAFE_ENV_VARS).toContain("no_proxy")
   })
 })
 

Feature	Windows	macOS / Linux
Binary detection	`where`	`which`
`.cmd/.bat` shims	Auto-wrapped with `cmd /c`	N/A
PATH augmentation	npm, scoop, cargo, pnpm dirs	Standard PATH
Large prompts (>30KB)	Delivered via `stdin` to avoid OS arg-length limits
Environment	Allowlisted vars only (no secrets leak to subprocesses)