diff --git a/docs/TODO.md b/docs/TODO.md
index a70e0bfbd..097620200 100644
--- a/docs/TODO.md
+++ b/docs/TODO.md
@@ -1,25 +1,140 @@
 # Documentation TODO
 
-## Installation
-- [x] Add Homebrew install instructions for Mac/Linux
-- [x] Mention that cagent comes pre-installed with Docker Desktop
-- [x] Explain why `task build-local` is recommended on Windows (Docker Buildx cross-compilation)
+## New Pages
 
-## Quick Start
-- [x] Clarify that Step 1 (install binary) and Step 2 (build from source) are two alternative options, not sequential steps
-- [x] Add a path to get started from a built-in example (`cagent run examples/...`) or the default agent
-- [x] Show running an agent from the registry (`cagent run agentcatalog/...`)
+- [x] **Go SDK** — Not documented anywhere. The `examples/golibrary/` directory shows how to use cagent as a Go library. Needs a dedicated page. *(Completed: pages/guides/go-sdk.html)*
+- [x] **Hooks** — `hooks` agent config (`pre_tool_use`, `post_tool_use`, `session_start`, `session_end`) is a significant feature with no documentation page. Covers running shell commands at various agent lifecycle points. *(Completed: pages/configuration/hooks.html)*
+- [x] **Permissions** — Top-level `permissions` config with `allow`/`deny` glob patterns for tool call approval. Mentioned briefly in TUI page but has no dedicated reference. *(Completed: pages/configuration/permissions.html)*
+- [x] **Sandbox Mode** — Shell tool `sandbox` config runs commands in Docker containers. Includes `image` and `paths` (bind mounts with `:ro` support). Not documented. *(Completed: pages/configuration/sandbox.html)*
+- [x] **Structured Output** — Agent-level `structured_output` config (name, description, schema, strict). Forces model responses into a JSON schema. Not documented. *(Completed: pages/configuration/structured-output.html)*
+- [x] **Model Routing** — Model-level `routing` config with rules that route requests to different models based on example phrases (rule-based router). *(Completed: pages/configuration/routing.html)*
+- [x] **Custom Providers (top-level `providers` section)** — The `providers` top-level config key for defining reusable provider definitions with `api_type`, `base_url`, `token_key`. *(Completed: Added to pages/configuration/overview.html)*
+- [x] **LSP Tool** — `type: lsp` toolset that provides Language Server Protocol integration (diagnostics, code actions, references, rename, etc.). Not documented anywhere. *(Completed: pages/tools/lsp.html)*
+- [x] **User Prompt Tool** — `type: user_prompt` toolset that allows agents to ask the user for input mid-conversation. Not documented. *(Completed: pages/tools/user-prompt.html)*
+- [x] **API Tool** — `api_config` on toolsets for defining HTTP API tools with endpoint, method, headers, args, and output_schema. Not documented. *(Completed: pages/tools/api.html)*
+- [ ] **Branching Sessions** — TUI feature (v1.20.6) allowing editing previous messages to create branches. Mentioned in TUI page but could use more detail.
 
-## Tools
-- [x] Add missing built-in tools documentation (think, todo, memory, fetch, script, etc.)
-- [x] Expand Tool Config page with more detailed content and examples for each tool type
+## Missing Details in Existing Pages
 
-## Features
-- [x] Fix broken TUI demo GIF link on the home page
+### Configuration > Agents (`agents.html`)
 
-## Core Concepts
-- [x] Move Agent Distribution from Features to Core Concepts (packaging & sharing is a fundamental concept)
+- [x] `welcome_message` — Agent property not listed in schema or properties table *(Added)*
+- [x] `handoffs` — Agent property for listing agents that can be handed off to (different from `sub_agents`). Not documented. *(Added)*
+- [x] `add_prompt_files` — Agent property for including additional prompt files. *(Added)*
+- [x] `add_description_parameter` — Agent property. *(Added)*
+- [x] `code_mode_tools` — Agent property. *(Added)*
+- [x] `hooks` — Agent property. Not shown in schema or properties table. *(Added with link to hooks page)*
+- [x] `structured_output` — Agent property. Not shown in schema or properties table. *(Added with link to structured-output page)*
+- [x] `defer` — Tool deferral configuration. *(Added with examples)*
+- [x] `permissions` — Permission configuration. *(Added with link to permissions page)*
+- [x] `sandbox` — Sandbox mode configuration. *(Added with link to sandbox page)*
 
-## New Pages
-- [ ] Remote runtime — remote runtime/server mode is not documented anywhere
-- [ ] Changelog / What's New — could add a page or section sourced from `CHANGELOG.md`
+### Configuration > Tools (`tools.html`)
+
+- [x] **LSP toolset** (`type: lsp`) — Language Server Protocol integration. *(Added with link to dedicated page)*
+- [x] **User Prompt toolset** (`type: user_prompt`) — User input collection. *(Added with link to dedicated page)*
+- [x] **API toolset** (`type: api`) — HTTP API tools. *(Added with link to dedicated page)*
+- [x] **Handoff toolset** (`type: handoff`) — A2A agent delegation. *(Added)*
+- [x] **A2A toolset** (`type: a2a`) — Toolset for connecting to remote A2A agents with `name` and `url`. *(Added)*
+- [x] **Shared todo** (`shared: true`) — Todo toolset option for sharing todos across agents. *(Added)*
+- [x] **Filesystem `post_edit`** — Post-edit commands that run after file edits (e.g., auto-format). *(Added)*
+- [x] **Filesystem `ignore_vcs`** — Option to ignore VCS (.gitignore) files. *(Added)*
+- [x] **Shell `env`** — Environment variables for shell/script/mcp/lsp tools. *(Added)*
+- [x] **Fetch `timeout`** — Fetch tool timeout configuration. *(Added)*
+- [x] **Script tool format** — Updated to show the correct `shell` map format with args, required, env, working_dir. *(Fixed)*
+- [x] **MCP `config`** — The `config` field on MCP toolsets. *(Added)*
+
+### Configuration > Models (`models.html`)
+
+- [x] `track_usage` — Model property to track token usage. *(Added)*
+- [x] `token_key` — Model property for specifying the env var holding the API token. *(Added)*
+- [x] `routing` — Model property for rule-based routing. *(Added with link to routing page)*
+- [x] `base_url` — Added examples showing how to use it with custom/self-hosted endpoints. *(Added)*
+
+### Configuration > Overview (`overview.html`)
+
+- [x] `metadata` — Top-level config section (author, license, description, readme, version). *(Added)*
+- [x] `permissions` — Top-level config section. *(Added link to permissions page)*
+- [x] `providers` — Top-level section. *(Added full documentation)*
+- [x] Config `version` field — Current version is "5" but not documented what it means or how migration works. *(Added)*
+- [x] **Advanced configuration cards** — Added cards linking to Hooks, Permissions, Sandbox, and Structured Output pages. *(Done)*
+
+### Features > CLI (`cli.html`)
+
+- [x] `--prompt-file` flag — Explanation of how it works (includes file contents as system context). *(Added)*
+- [x] `--session` with relative references — e.g., `-1` for last session, `-2` for second to last. *(Added)*
+- [x] Multi-turn conversations in `cagent exec` — Added example. *(Added)*
+- [x] Queueing multiple messages: `cagent run question1 question2 ...` *(Added)*
+- [x] `cagent eval` flags — Added examples with flags. *(Added)*
+- [x] `cagent build` command — *(Added)*
+- [ ] `--exit-on-stdin-eof` flag — Hidden flag, low priority.
+- [ ] `--keep-containers` flag for eval — Already documented in eval page.
+
+### Features > TUI (`tui.html`)
+
+- [x] Ctrl+R reverse history search — *(Added with dedicated section)*
+- [x] `/title` command for renaming sessions — *(Already documented)*
+- [x] `/think` command to toggle thinking at runtime — *(Already documented)*
+- [x] Custom themes and hot-reloading — *(Already documented)*
+- [x] Ctrl+Z to suspend TUI — *(Added)*
+- [x] Ctrl+L audio listening shortcut — *(Added)*
+- [x] Ctrl+X to clear queued messages — *(Added)*
+- [x] Permissions view dialog — *(Mentioned)*
+- [x] Model picker / switching during session — *(Already documented)*
+- [ ] Branching sessions (edit previous messages) — Mentioned but could have more detail.
+- [ ] Double-click title to edit — Minor feature.
+
+### Features > Skills (`skills.html`)
+
+- [x] Skill invocation via slash commands — *(Added)*
+- [x] Recursive `~/.agents/skills` directory support — *(Clarified in table)*
+
+### Features > Evaluation (`evaluation.html`)
+
+- [x] `--keep-containers` flag — *(Already documented)*
+- [x] Session database produced for investigation — *(Added note)*
+- [x] Debugging tip for failed evals — *(Added callout)*
+
+### Providers
+
+- [x] **Mistral** — Listed as built-in alias but has no dedicated page or usage examples. *(Completed: pages/providers/mistral.html)*
+- [x] **xAI (Grok)** — *(Completed: pages/providers/xai.html)*
+- [x] **Nebius** — *(Completed: pages/providers/nebius.html)*
+- [x] **Ollama** — Can be used via custom providers. *(Completed: pages/providers/local.html - covers Ollama, vLLM, LocalAI)*
+
+### Features > RAG (`rag.html`)
+
+- [x] `respect_vcs` option — *(Added with default value)*
+- [x] `return_full_content` results option — *(Added)*
+- [ ] Code-aware chunking (`code_aware: true`) with tree-sitter — Partially documented, the option is shown in examples.
+
+### Community > Troubleshooting
+
+- [x] Common errors: context window exceeded, max iterations reached, model fallback behavior. *(Added)*
+- [x] Debugging with `--debug` and `--log-file` — *(Already documented)*
+
+## Tips & Best Practices
+
+- [x] *(Completed: pages/guides/tips.html)* - Comprehensive tips page covering:
+  - [x] **Tip: Using `--yolo` mode** — Auto-approve all tool calls. Security implications and when it's appropriate.
+  - [x] **Tip: Environment variable interpolation in commands** — Commands support `${env.VAR}` and `${env.VAR || 'default'}` JavaScript template syntax.
+  - [x] **Tip: Fallback model strategy** — Best practices for choosing fallback models.
+  - [x] **Tip: Deferred tools for performance** — Use `defer: true` to load tools only when needed.
+  - [x] **Tip: Combining handoffs and sub_agents** — Explain the difference.
+  - [x] **Tip: Using the `auto` model** — The special `auto` model value for automatic model selection.
+  - [x] **Tip: Model aliases and pinning** — cagent automatically resolves model aliases to pinned versions.
+  - [x] **Tip: User-defined default model** — Users can define their own default model in global configuration. *(Added)*
+  - [x] **Tip: Usage on Github** - Example of the PR reviewer. *(Added)*
+
+## Navigation Updates
+
+- [x] Added Model Routing to Configuration section
+- [x] Added xAI (Grok) and Nebius to Model Providers section
+- [x] Added Guides section with Tips & Best Practices and Go SDK
+
+## Remaining Low-Priority Items
+
+- [ ] Branching sessions — More detailed documentation (currently mentioned)
+- [ ] Double-click title to edit in TUI — Minor feature
+- [ ] `--exit-on-stdin-eof` flag — Hidden flag for integration
+- [ ] Code-aware chunking detail — Already shown in examples
diff --git a/docs/js/app.js b/docs/js/app.js
index c74cf22cb..9e14188ac 100644
--- a/docs/js/app.js
+++ b/docs/js/app.js
@@ -29,6 +29,19 @@ const NAV = [
       { title: 'Agent Config',  page: 'configuration/agents' },
       { title: 'Model Config',  page: 'configuration/models' },
       { title: 'Tool Config',   page: 'configuration/tools' },
+      { title: 'Hooks',         page: 'configuration/hooks' },
+      { title: 'Permissions',   page: 'configuration/permissions' },
+      { title: 'Sandbox Mode',  page: 'configuration/sandbox' },
+      { title: 'Structured Output', page: 'configuration/structured-output' },
+      { title: 'Model Routing', page: 'configuration/routing' },
+    ],
+  },
+  {
+    heading: 'Built-in Tools',
+    items: [
+      { title: 'LSP Tool',       page: 'tools/lsp' },
+      { title: 'User Prompt Tool', page: 'tools/user-prompt' },
+      { title: 'API Tool',       page: 'tools/api' },
     ],
   },
   {
@@ -55,9 +68,20 @@ const NAV = [
       { title: 'Google Gemini', page: 'providers/google' },
       { title: 'AWS Bedrock',  page: 'providers/bedrock' },
       { title: 'Docker Model Runner', page: 'providers/dmr' },
+      { title: 'Mistral',      page: 'providers/mistral' },
+      { title: 'xAI (Grok)',   page: 'providers/xai' },
+      { title: 'Nebius',       page: 'providers/nebius' },
+      { title: 'Local Models', page: 'providers/local' },
       { title: 'Custom Providers',    page: 'providers/custom' },
     ],
   },
+  {
+    heading: 'Guides',
+    items: [
+      { title: 'Tips & Best Practices', page: 'guides/tips' },
+      { title: 'Go SDK',       page: 'guides/go-sdk' },
+    ],
+  },
   {
     heading: 'Community',
     items: [
diff --git a/docs/pages/community/troubleshooting.html b/docs/pages/community/troubleshooting.html
index 5c89eb10a..ef6e3d51f 100644
--- a/docs/pages/community/troubleshooting.html
+++ b/docs/pages/community/troubleshooting.html
@@ -1,6 +1,49 @@
 <h1>Troubleshooting</h1>
 <p class="subtitle">Common issues and how to resolve them when working with cagent.</p>
 
+<h2>Common Errors</h2>
+
+<h3>Context Window Exceeded</h3>
+
+<p>Error message: <code>context_length_exceeded</code> or similar.</p>
+
+<ul>
+  <li>Use <code>/compact</code> in the TUI to summarize and reduce conversation history</li>
+  <li>Set <code>num_history_items</code> in agent config to limit messages sent to the model</li>
+  <li>Switch to a model with larger context (e.g., Claude 200K, Gemini 2M)</li>
+  <li>Break large tasks into smaller conversations</li>
+</ul>
+
+<h3>Max Iterations Reached</h3>
+
+<p>The agent hit its <code>max_iterations</code> limit without completing the task.</p>
+
+<ul>
+  <li>Increase <code>max_iterations</code> in agent config (default is unlimited, but many agents set 20-50)</li>
+  <li>Check if the agent is stuck in a loop (enable <code>--debug</code> to see tool calls)</li>
+  <li>Break complex tasks into smaller steps</li>
+</ul>
+
+<h3>Model Fallback Triggered</h3>
+
+<p>When the primary model fails, cagent automatically switches to fallback models. Look for log messages like <code>"Switching to fallback model"</code>.</p>
+
+<ul>
+  <li><strong>429 errors:</strong> Rate limited — the cooldown period keeps using the fallback</li>
+  <li><strong>5xx errors:</strong> Server issues — retries with exponential backoff first, then falls back</li>
+  <li><strong>4xx errors:</strong> Client errors — skips directly to next model</li>
+</ul>
+
+<p>Configure fallback behavior in your agent config:</p>
+
+<pre><code class="language-yaml">agents:
+  root:
+    model: anthropic/claude-sonnet-4-0
+    fallback:
+      models: [openai/gpt-4o, openai/gpt-4o-mini]
+      retries: 2      # retries per model for 5xx errors
+      cooldown: 1m    # how long to stick with fallback after 429</code></pre>
+
 <h2>Debug Mode</h2>
 
 <p>The first step for any issue is enabling debug logging. This provides detailed information about what cagent is doing internally.</p>
diff --git a/docs/pages/configuration/agents.html b/docs/pages/configuration/agents.html
index 449703362..11984a7f1 100644
--- a/docs/pages/configuration/agents.html
+++ b/docs/pages/configuration/agents.html
@@ -8,20 +8,40 @@ <h2>Full Schema</h2>
     model: string                  # Required: model reference
     description: string            # Required: what this agent does
     instruction: string            # Required: system prompt
-    sub_agents: [list]              # Optional: sub-agent names
+    sub_agents: [list]             # Optional: sub-agent names
     toolsets: [list]               # Optional: tool configurations
     rag: [list]                    # Optional: RAG source references
-    fallback:                       # Optional: fallback config
+    fallback:                      # Optional: fallback config
       models: [list]
       retries: 2
       cooldown: 1m
     add_date: boolean              # Optional: add date to context
     add_environment_info: boolean  # Optional: add env info to context
+    add_prompt_files: [list]       # Optional: include additional prompt files
+    add_description_parameter: bool # Optional: add description to tool schema
+    code_mode_tools: boolean       # Optional: enable code mode tool format
     max_iterations: int            # Optional: max tool-calling loops
     num_history_items: int         # Optional: limit conversation history
-    skills: boolean               # Optional: enable skill discovery
-    commands:                       # Optional: named prompts
-      name: "prompt text"</code></pre>
+    skills: boolean                # Optional: enable skill discovery
+    commands:                      # Optional: named prompts
+      name: "prompt text"
+    welcome_message: string        # Optional: message shown at session start
+    handoffs: [list]               # Optional: list of A2A handoff agents
+    defer: [list] or true          # Optional: tools to load on demand
+    hooks:                         # Optional: lifecycle hooks
+      pre_tool_use: [list]
+      post_tool_use: [list]
+      session_start: [list]
+      session_end: [list]
+    permissions:                   # Optional: tool execution control
+      allow: [list]
+      deny: [list]
+    sandbox:                       # Optional: shell isolation
+      image: string
+      paths: [list]
+    structured_output:             # Optional: constrain output format
+      name: string
+      schema: object</code></pre>
 
 <div class="callout callout-tip">
   <div class="callout-title">💡 See also</div>
@@ -67,6 +87,18 @@ <h2>Properties Reference</h2>
       <td><code>add_environment_info</code></td><td>boolean</td><td>✗</td>
       <td>When <code>true</code>, injects working directory, OS, CPU architecture, and git info into context.</td>
     </tr>
+    <tr>
+      <td><code>add_prompt_files</code></td><td>array</td><td>✗</td>
+      <td>List of file paths whose contents are appended to the system prompt. Useful for including coding standards, guidelines, or additional context.</td>
+    </tr>
+    <tr>
+      <td><code>add_description_parameter</code></td><td>boolean</td><td>✗</td>
+      <td>When <code>true</code>, adds agent descriptions as a parameter in tool schemas. Helps with tool selection in multi-agent scenarios.</td>
+    </tr>
+    <tr>
+      <td><code>code_mode_tools</code></td><td>boolean</td><td>✗</td>
+      <td>When <code>true</code>, formats tool responses in a code-optimized format with structured output schemas. Useful for MCP gateway and programmatic access.</td>
+    </tr>
     <tr>
       <td><code>max_iterations</code></td><td>int</td><td>✗</td>
       <td>Maximum number of tool-calling loops. Default: unlimited (0). Set this to prevent infinite loops.</td>
@@ -87,6 +119,34 @@ <h2>Properties Reference</h2>
       <td><code>commands</code></td><td>object</td><td>✗</td>
       <td>Named prompts that can be run with <code>cagent run config.yaml /command_name</code>.</td>
     </tr>
+    <tr>
+      <td><code>welcome_message</code></td><td>string</td><td>✗</td>
+      <td>Message displayed to the user when a session starts. Useful for providing context or instructions.</td>
+    </tr>
+    <tr>
+      <td><code>handoffs</code></td><td>array</td><td>✗</td>
+      <td>List of A2A agent configurations this agent can delegate to. See <a href="#features/a2a" onclick="event.preventDefault(); navigate('features/a2a')">A2A Protocol</a>.</td>
+    </tr>
+    <tr>
+      <td><code>defer</code></td><td>array/boolean</td><td>✗</td>
+      <td>Tools to load on-demand rather than at startup. Set to <code>true</code> to defer all tools, or provide a list of tool names.</td>
+    </tr>
+    <tr>
+      <td><code>hooks</code></td><td>object</td><td>✗</td>
+      <td>Lifecycle hooks for running commands at various points. See <a href="#configuration/hooks" onclick="event.preventDefault(); navigate('configuration/hooks')">Hooks</a>.</td>
+    </tr>
+    <tr>
+      <td><code>permissions</code></td><td>object</td><td>✗</td>
+      <td>Control which tools are auto-approved, require confirmation, or are blocked. See <a href="#configuration/permissions" onclick="event.preventDefault(); navigate('configuration/permissions')">Permissions</a>.</td>
+    </tr>
+    <tr>
+      <td><code>sandbox</code></td><td>object</td><td>✗</td>
+      <td>Run shell commands in an isolated Docker container. See <a href="#configuration/sandbox" onclick="event.preventDefault(); navigate('configuration/sandbox')">Sandbox Mode</a>.</td>
+    </tr>
+    <tr>
+      <td><code>structured_output</code></td><td>object</td><td>✗</td>
+      <td>Constrain agent output to match a JSON schema. See <a href="#configuration/structured-output" onclick="event.preventDefault(); navigate('configuration/structured-output')">Structured Output</a>.</td>
+    </tr>
   </tbody>
 </table>
 
@@ -95,6 +155,52 @@ <h2>Properties Reference</h2>
   <p>Default is <code>0</code> (unlimited). Always set <code>max_iterations</code> for agents with powerful tools like <code>shell</code> to prevent infinite loops. A value of 20–50 is typical for development agents.</p>
 </div>
 
+<h2>Welcome Message</h2>
+
+<p>Display a message when users start a session:</p>
+
+<pre><code class="language-yaml">agents:
+  assistant:
+    model: openai/gpt-4o
+    description: Development assistant
+    instruction: You are a helpful coding assistant.
+    welcome_message: |
+      👋 Welcome! I'm your development assistant.
+      
+      I can help you with:
+      - Writing and reviewing code
+      - Running tests and debugging
+      - Explaining concepts
+      
+      What would you like to work on?</code></pre>
+
+<h2>Deferred Tool Loading</h2>
+
+<p>Load tools on-demand to speed up agent startup:</p>
+
+<pre><code class="language-yaml">agents:
+  root:
+    model: anthropic/claude-sonnet-4-0
+    description: Multi-purpose assistant
+    instruction: You have access to many tools.
+    toolsets:
+      - type: mcp
+        ref: docker:github-official
+      - type: mcp
+        ref: docker:slack
+      - type: filesystem
+    # Defer all tools - load when first used
+    defer: true</code></pre>
+
+<p>Or defer specific tools:</p>
+
+<pre><code class="language-yaml">agents:
+  root:
+    model: openai/gpt-4o
+    defer:
+      - "mcp:github:*"    # Defer all GitHub tools
+      - "mcp:slack:*"     # Defer all Slack tools</code></pre>
+
 <h2>Fallback Configuration</h2>
 
 <p>Automatically switch to backup models when the primary fails:</p>
@@ -160,6 +266,7 @@ <h2>Complete Example</h2>
     instruction: |
       You are a technical lead. Analyze requests and delegate
       to the right specialist. Always review work before responding.
+    welcome_message: "👋 I'm your tech lead. How can I help today?"
     sub_agents: [developer, researcher]
     add_date: true
     add_environment_info: true
@@ -169,6 +276,10 @@ <h2>Complete Example</h2>
       - type: think
     commands:
       review: "Review all recent code changes for issues"
+    hooks:
+      session_start:
+        - type: command
+          command: "./scripts/setup.sh"
 
   developer:
     model: claude
@@ -180,6 +291,17 @@ <h2>Complete Example</h2>
       - type: shell
       - type: think
       - type: todo
+    permissions:
+      allow:
+        - "read_*"
+        - "shell:cmd=go*"
+        - "shell:cmd=npm*"
+      deny:
+        - "shell:cmd=sudo*"
+    sandbox:
+      image: golang:1.23-alpine
+      paths:
+        - "."
 
   researcher:
     model: openai/gpt-4o
diff --git a/docs/pages/configuration/hooks.html b/docs/pages/configuration/hooks.html
new file mode 100644
index 000000000..bfc1bef90
--- /dev/null
+++ b/docs/pages/configuration/hooks.html
@@ -0,0 +1,233 @@
+<h1>Hooks</h1>
+<p class="subtitle">Run shell commands at various points during agent execution for deterministic control over behavior.</p>
+
+<h2>Overview</h2>
+
+<p>Hooks allow you to execute shell commands or scripts at key points in an agent's lifecycle. They provide deterministic control that works alongside the LLM's behavior, enabling validation, logging, environment setup, and more.</p>
+
+<div class="callout callout-info">
+  <div class="callout-title">ℹ️ Use Cases</div>
+  <ul>
+    <li>Validate or transform tool inputs before execution</li>
+    <li>Log all tool calls to an audit file</li>
+    <li>Block dangerous operations based on custom rules</li>
+    <li>Set up the environment when a session starts</li>
+    <li>Clean up resources when a session ends</li>
+  </ul>
+</div>
+
+<h2>Hook Types</h2>
+
+<p>There are four hook event types:</p>
+
+<table>
+  <thead><tr><th>Event</th><th>When it fires</th><th>Can block?</th></tr></thead>
+  <tbody>
+    <tr><td><code>pre_tool_use</code></td><td>Before a tool call executes</td><td>Yes</td></tr>
+    <tr><td><code>post_tool_use</code></td><td>After a tool completes successfully</td><td>No</td></tr>
+    <tr><td><code>session_start</code></td><td>When a session begins or resumes</td><td>No</td></tr>
+    <tr><td><code>session_end</code></td><td>When a session terminates</td><td>No</td></tr>
+  </tbody>
+</table>
+
+<h2>Configuration</h2>
+
+<pre><code class="language-yaml">agents:
+  root:
+    model: openai/gpt-4o
+    description: An agent with hooks
+    instruction: You are a helpful assistant.
+    hooks:
+      # Run before specific tools
+      pre_tool_use:
+        - matcher: "shell|edit_file"
+          hooks:
+            - type: command
+              command: "./scripts/validate-command.sh"
+              timeout: 30
+      
+      # Run after all tool calls
+      post_tool_use:
+        - matcher: "*"
+          hooks:
+            - type: command
+              command: "./scripts/log-tool-call.sh"
+      
+      # Run when session starts
+      session_start:
+        - type: command
+          command: "./scripts/setup-env.sh"
+      
+      # Run when session ends
+      session_end:
+        - type: command
+          command: "./scripts/cleanup.sh"</code></pre>
+
+<h2>Matcher Patterns</h2>
+
+<p>The <code>matcher</code> field uses regex patterns to match tool names:</p>
+
+<table>
+  <thead><tr><th>Pattern</th><th>Matches</th></tr></thead>
+  <tbody>
+    <tr><td><code>*</code></td><td>All tools</td></tr>
+    <tr><td><code>shell</code></td><td>Only the <code>shell</code> tool</td></tr>
+    <tr><td><code>shell|edit_file</code></td><td>Either <code>shell</code> or <code>edit_file</code></td></tr>
+    <tr><td><code>mcp:.*</code></td><td>All MCP tools (regex)</td></tr>
+  </tbody>
+</table>
+
+<h2>Hook Input</h2>
+
+<p>Hooks receive JSON input via stdin with context about the event:</p>
+
+<pre><code class="language-json">{
+  "session_id": "abc123",
+  "cwd": "/path/to/project",
+  "hook_event_name": "pre_tool_use",
+  "tool_name": "shell",
+  "tool_use_id": "call_xyz",
+  "tool_input": {
+    "cmd": "rm -rf /tmp/cache",
+    "cwd": "."
+  }
+}</code></pre>
+
+<h3>Input Fields by Event Type</h3>
+
+<table>
+  <thead><tr><th>Field</th><th>pre_tool_use</th><th>post_tool_use</th><th>session_start</th><th>session_end</th></tr></thead>
+  <tbody>
+    <tr><td><code>session_id</code></td><td>✓</td><td>✓</td><td>✓</td><td>✓</td></tr>
+    <tr><td><code>cwd</code></td><td>✓</td><td>✓</td><td>✓</td><td>✓</td></tr>
+    <tr><td><code>hook_event_name</code></td><td>✓</td><td>✓</td><td>✓</td><td>✓</td></tr>
+    <tr><td><code>tool_name</code></td><td>✓</td><td>✓</td><td></td><td></td></tr>
+    <tr><td><code>tool_use_id</code></td><td>✓</td><td>✓</td><td></td><td></td></tr>
+    <tr><td><code>tool_input</code></td><td>✓</td><td>✓</td><td></td><td></td></tr>
+    <tr><td><code>tool_response</code></td><td></td><td>✓</td><td></td><td></td></tr>
+    <tr><td><code>source</code></td><td></td><td></td><td>✓</td><td></td></tr>
+    <tr><td><code>reason</code></td><td></td><td></td><td></td><td>✓</td></tr>
+  </tbody>
+</table>
+
+<p>The <code>source</code> field for <code>session_start</code> can be: <code>startup</code>, <code>resume</code>, <code>clear</code>, or <code>compact</code>.</p>
+<p>The <code>reason</code> field for <code>session_end</code> can be: <code>clear</code>, <code>logout</code>, <code>prompt_input_exit</code>, or <code>other</code>.</p>
+
+<h2>Hook Output</h2>
+
+<p>Hooks communicate back via JSON output to stdout:</p>
+
+<pre><code class="language-json">{
+  "continue": true,
+  "stop_reason": "Optional message when continue=false",
+  "suppress_output": false,
+  "system_message": "Warning message to show user",
+  "decision": "allow",
+  "reason": "Explanation for the decision",
+  "hook_specific_output": {
+    "hook_event_name": "pre_tool_use",
+    "permission_decision": "allow",
+    "permission_decision_reason": "Command is safe",
+    "updated_input": { "cmd": "modified command" }
+  }
+}</code></pre>
+
+<h3>Output Fields</h3>
+
+<table>
+  <thead><tr><th>Field</th><th>Type</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>continue</code></td><td>boolean</td><td>Whether to continue execution (default: <code>true</code>)</td></tr>
+    <tr><td><code>stop_reason</code></td><td>string</td><td>Message to show when <code>continue=false</code></td></tr>
+    <tr><td><code>suppress_output</code></td><td>boolean</td><td>Hide stdout from transcript</td></tr>
+    <tr><td><code>system_message</code></td><td>string</td><td>Warning message to display to user</td></tr>
+    <tr><td><code>decision</code></td><td>string</td><td>For blocking: <code>block</code> to prevent operation</td></tr>
+    <tr><td><code>reason</code></td><td>string</td><td>Explanation for the decision</td></tr>
+  </tbody>
+</table>
+
+<h3>Pre-Tool-Use Specific Output</h3>
+
+<p>The <code>hook_specific_output</code> for <code>pre_tool_use</code> supports:</p>
+
+<table>
+  <thead><tr><th>Field</th><th>Type</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>permission_decision</code></td><td>string</td><td><code>allow</code>, <code>deny</code>, or <code>ask</code></td></tr>
+    <tr><td><code>permission_decision_reason</code></td><td>string</td><td>Explanation for the decision</td></tr>
+    <tr><td><code>updated_input</code></td><td>object</td><td>Modified tool input (replaces original)</td></tr>
+  </tbody>
+</table>
+
+<h2>Exit Codes</h2>
+
+<p>Hook exit codes have special meaning:</p>
+
+<table>
+  <thead><tr><th>Exit Code</th><th>Meaning</th></tr></thead>
+  <tbody>
+    <tr><td><code>0</code></td><td>Success — continue normally</td></tr>
+    <tr><td><code>2</code></td><td>Blocking error — stop the operation</td></tr>
+    <tr><td>Other</td><td>Error — logged but execution continues</td></tr>
+  </tbody>
+</table>
+
+<h2>Example: Validation Script</h2>
+
+<p>A simple pre-tool-use hook that blocks dangerous shell commands:</p>
+
+<pre><code class="language-bash">#!/bin/bash
+# scripts/validate-command.sh
+
+# Read JSON input from stdin
+INPUT=$(cat)
+TOOL_NAME=$(echo "$INPUT" | jq -r '.tool_name')
+CMD=$(echo "$INPUT" | jq -r '.tool_input.cmd // empty')
+
+# Block dangerous commands
+if [[ "$TOOL_NAME" == "shell" ]]; then
+  if [[ "$CMD" =~ ^sudo ]] || [[ "$CMD" =~ rm.*-rf ]]; then
+    echo '{"decision": "block", "reason": "Dangerous command blocked by policy"}'
+    exit 2
+  fi
+fi
+
+# Allow everything else
+echo '{"decision": "allow"}'
+exit 0</code></pre>
+
+<h2>Example: Audit Logging</h2>
+
+<p>A post-tool-use hook that logs all tool calls:</p>
+
+<pre><code class="language-bash">#!/bin/bash
+# scripts/log-tool-call.sh
+
+INPUT=$(cat)
+TIMESTAMP=$(date -u +"%Y-%m-%dT%H:%M:%SZ")
+TOOL_NAME=$(echo "$INPUT" | jq -r '.tool_name')
+SESSION_ID=$(echo "$INPUT" | jq -r '.session_id')
+
+# Append to audit log
+echo "$TIMESTAMP | $SESSION_ID | $TOOL_NAME" >> ./audit.log
+
+# Don't block execution
+echo '{"continue": true}'
+exit 0</code></pre>
+
+<h2>Timeout</h2>
+
+<p>Hooks have a default timeout of 60 seconds. You can customize this per hook:</p>
+
+<pre><code class="language-yaml">hooks:
+  pre_tool_use:
+    - matcher: "*"
+      hooks:
+        - type: command
+          command: "./slow-validation.sh"
+          timeout: 120  # 2 minutes</code></pre>
+
+<div class="callout callout-warning">
+  <div class="callout-title">⚠️ Performance</div>
+  <p>Hooks run synchronously and can slow down agent execution. Keep hook scripts fast and efficient. Consider using <code>suppress_output: true</code> for logging hooks to reduce noise.</p>
+</div>
diff --git a/docs/pages/configuration/models.html b/docs/pages/configuration/models.html
index 7a6f4bf3f..259f45469 100644
--- a/docs/pages/configuration/models.html
+++ b/docs/pages/configuration/models.html
@@ -13,8 +13,11 @@ <h2>Full Schema</h2>
     frequency_penalty: float     # Optional: 0.0–2.0
     presence_penalty: float      # Optional: 0.0–2.0
     base_url: string             # Optional: custom API endpoint
+    token_key: string            # Optional: env var for API token
     thinking_budget: string|int  # Optional: reasoning effort
     parallel_tool_calls: boolean # Optional: allow parallel tool calls
+    track_usage: boolean         # Optional: track token usage
+    routing: [list]              # Optional: rule-based model routing
     provider_opts:                # Optional: provider-specific options
       key: value</code></pre>
 
@@ -30,9 +33,12 @@ <h2>Properties Reference</h2>
     <tr><td><code>top_p</code></td><td>float</td><td>✗</td><td>Nucleus sampling threshold</td></tr>
     <tr><td><code>frequency_penalty</code></td><td>float</td><td>✗</td><td>Penalize repeated tokens (0.0–2.0)</td></tr>
     <tr><td><code>presence_penalty</code></td><td>float</td><td>✗</td><td>Encourage topic diversity (0.0–2.0)</td></tr>
-    <tr><td><code>base_url</code></td><td>string</td><td>✗</td><td>Custom API endpoint URL</td></tr>
+    <tr><td><code>base_url</code></td><td>string</td><td>✗</td><td>Custom API endpoint URL (for self-hosted or proxied endpoints)</td></tr>
+    <tr><td><code>token_key</code></td><td>string</td><td>✗</td><td>Environment variable name containing the API token (overrides provider default)</td></tr>
     <tr><td><code>thinking_budget</code></td><td>string/int</td><td>✗</td><td>Reasoning effort control</td></tr>
     <tr><td><code>parallel_tool_calls</code></td><td>boolean</td><td>✗</td><td>Allow model to call multiple tools at once</td></tr>
+    <tr><td><code>track_usage</code></td><td>boolean</td><td>✗</td><td>Track and report token usage for this model</td></tr>
+    <tr><td><code>routing</code></td><td>array</td><td>✗</td><td>Rule-based routing to different models. See <a href="#configuration/routing" onclick="event.preventDefault(); navigate('configuration/routing')">Model Routing</a>.</td></tr>
     <tr><td><code>provider_opts</code></td><td>object</td><td>✗</td><td>Provider-specific options (see provider pages)</td></tr>
   </tbody>
 </table>
@@ -128,3 +134,30 @@ <h2>Examples by Provider</h2>
     max_tokens: 8192</code></pre>
 
 <p>For detailed provider setup, see the <a href="#providers/overview" onclick="event.preventDefault(); navigate('providers/overview')">Model Providers</a> section.</p>
+
+<h2>Custom Endpoints</h2>
+
+<p>Use <code>base_url</code> to point to custom or self-hosted endpoints:</p>
+
+<pre><code class="language-yaml">models:
+  # Azure OpenAI
+  azure_gpt:
+    provider: openai
+    model: gpt-4o
+    base_url: https://my-resource.openai.azure.com/openai/deployments/gpt-4o
+    token_key: AZURE_OPENAI_API_KEY
+
+  # Self-hosted vLLM
+  local_llama:
+    provider: openai  # vLLM is OpenAI-compatible
+    model: meta-llama/Llama-3.2-3B-Instruct
+    base_url: http://localhost:8000/v1
+
+  # Proxy or gateway
+  proxied:
+    provider: openai
+    model: gpt-4o
+    base_url: https://proxy.internal.company.com/openai/v1
+    token_key: INTERNAL_API_KEY</code></pre>
+
+<p>See <a href="#providers/local" onclick="event.preventDefault(); navigate('providers/local')">Local Models</a> for more examples of custom endpoints.</p>
diff --git a/docs/pages/configuration/overview.html b/docs/pages/configuration/overview.html
index 8263fc310..fb7607d77 100644
--- a/docs/pages/configuration/overview.html
+++ b/docs/pages/configuration/overview.html
@@ -3,16 +3,25 @@ <h1>Configuration Overview</h1>
 
 <h2>File Structure</h2>
 
-<p>A cagent YAML config has three main sections:</p>
+<p>A cagent YAML config has these main sections:</p>
 
-<pre><code class="language-bash"># 1. Models — define AI models with their parameters
+<pre><code class="language-bash"># 1. Version — configuration schema version (optional but recommended)
+version: 5
+
+# 2. Metadata — optional agent metadata for distribution
+metadata:
+  author: my-org
+  description: My helpful agent
+  version: "1.0.0"
+
+# 3. Models — define AI models with their parameters
 models:
   claude:
     provider: anthropic
     model: claude-sonnet-4-0
     max_tokens: 64000
 
-# 2. Agents — define AI agents with their behavior
+# 4. Agents — define AI agents with their behavior
 agents:
   root:
     model: claude
@@ -21,12 +30,25 @@ <h2>File Structure</h2>
     toolsets:
       - type: think
 
-# 3. Providers — optional custom provider definitions
+# 5. RAG — define retrieval-augmented generation sources (optional)
+rag:
+  docs:
+    docs: ["./docs"]
+    strategies:
+      - type: chunked-embeddings
+        model: openai/text-embedding-3-small
+
+# 6. Providers — optional custom provider definitions
 providers:
   my_provider:
     api_type: openai_chatcompletions
     base_url: https://api.example.com/v1
-    token_key: MY_API_KEY</code></pre>
+    token_key: MY_API_KEY
+
+# 7. Permissions — global tool permission rules (optional)
+permissions:
+  allow: ["read_*"]
+  deny: ["shell:cmd=sudo*"]</code></pre>
 
 <h2>Minimal Config</h2>
 
@@ -61,7 +83,7 @@ <h2>Config Sections</h2>
   <a class="card" href="#configuration/agents" onclick="event.preventDefault(); navigate('configuration/agents')">
     <div class="card-icon">🤖</div>
     <h3>Agent Config</h3>
-    <p>All agent properties: model, instruction, tools, sub-agents, and more.</p>
+    <p>All agent properties: model, instruction, tools, sub-agents, permissions, hooks, and more.</p>
   </a>
   <a class="card" href="#configuration/models" onclick="event.preventDefault(); navigate('configuration/models')">
     <div class="card-icon">🧠</div>
@@ -71,7 +93,32 @@ <h3>Model Config</h3>
   <a class="card" href="#configuration/tools" onclick="event.preventDefault(); navigate('configuration/tools')">
     <div class="card-icon">🔧</div>
     <h3>Tool Config</h3>
-    <p>Built-in tools, MCP tools, Docker MCP, and tool filtering.</p>
+    <p>Built-in tools, MCP tools, Docker MCP, LSP, API tools, and tool filtering.</p>
+  </a>
+</div>
+
+<h2>Advanced Configuration</h2>
+
+<div class="cards">
+  <a class="card" href="#configuration/hooks" onclick="event.preventDefault(); navigate('configuration/hooks')">
+    <div class="card-icon">⚡</div>
+    <h3>Hooks</h3>
+    <p>Run shell commands at lifecycle events like tool calls and session start/end.</p>
+  </a>
+  <a class="card" href="#configuration/permissions" onclick="event.preventDefault(); navigate('configuration/permissions')">
+    <div class="card-icon">🔐</div>
+    <h3>Permissions</h3>
+    <p>Control which tools auto-approve, require confirmation, or are blocked.</p>
+  </a>
+  <a class="card" href="#configuration/sandbox" onclick="event.preventDefault(); navigate('configuration/sandbox')">
+    <div class="card-icon">📦</div>
+    <h3>Sandbox Mode</h3>
+    <p>Run shell commands in an isolated Docker container for security.</p>
+  </a>
+  <a class="card" href="#configuration/structured-output" onclick="event.preventDefault(); navigate('configuration/structured-output')">
+    <div class="card-icon">📋</div>
+    <h3>Structured Output</h3>
+    <p>Constrain agent responses to match a specific JSON schema.</p>
   </a>
 </div>
 
@@ -113,3 +160,76 @@ <h2>JSON Schema</h2>
 <p>For editor autocompletion and validation, use the <a href="https://github.com/docker/cagent/blob/main/cagent-schema.json" target="_blank" rel="noopener noreferrer">cagent JSON Schema</a>. Add this to the top of your YAML file:</p>
 
 <pre><code class="language-bash"># yaml-language-server: $schema=https://raw.githubusercontent.com/docker/cagent/main/cagent-schema.json</code></pre>
+
+<h2>Config Versioning</h2>
+
+<p>cagent configs are versioned. The current version is <code>5</code>. Add the version at the top of your config:</p>
+
+<pre><code class="language-yaml">version: 5
+
+agents:
+  root:
+    model: openai/gpt-4o
+    # ...</code></pre>
+
+<p>When you load an older config, cagent automatically migrates it to the latest schema. It's recommended to include the version to ensure consistent behavior.</p>
+
+<h2>Metadata Section</h2>
+
+<p>Optional metadata for agent distribution via OCI registries:</p>
+
+<pre><code class="language-yaml">metadata:
+  author: my-org
+  license: Apache-2.0
+  description: A helpful coding assistant
+  readme: |  # Displayed in registries
+    This agent helps with coding tasks.
+  version: "1.0.0"</code></pre>
+
+<table>
+  <thead><tr><th>Field</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>author</code></td><td>Author or organization name</td></tr>
+    <tr><td><code>license</code></td><td>License identifier (e.g., Apache-2.0, MIT)</td></tr>
+    <tr><td><code>description</code></td><td>Short description for the agent</td></tr>
+    <tr><td><code>readme</code></td><td>Longer markdown description</td></tr>
+    <tr><td><code>version</code></td><td>Semantic version string</td></tr>
+  </tbody>
+</table>
+
+<p>See <a href="#concepts/distribution" onclick="event.preventDefault(); navigate('concepts/distribution')">Agent Distribution</a> for publishing agents to registries.</p>
+
+<h2>Custom Providers Section</h2>
+
+<p>Define reusable provider configurations for custom or self-hosted endpoints:</p>
+
+<pre><code class="language-yaml">providers:
+  azure:
+    api_type: openai_chatcompletions
+    base_url: https://my-resource.openai.azure.com/openai/deployments/gpt-4o
+    token_key: AZURE_OPENAI_API_KEY
+  
+  internal_llm:
+    api_type: openai_chatcompletions
+    base_url: https://llm.internal.company.com/v1
+    token_key: INTERNAL_API_KEY
+
+models:
+  azure_gpt:
+    provider: azure  # References the custom provider
+    model: gpt-4o
+
+agents:
+  root:
+    model: azure_gpt</code></pre>
+
+<table>
+  <thead><tr><th>Field</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>api_type</code></td><td>API schema: <code>openai_chatcompletions</code> (default) or <code>openai_responses</code></td></tr>
+    <tr><td><code>base_url</code></td><td>Base URL for the API endpoint</td></tr>
+    <tr><td><code>token_key</code></td><td>Environment variable name for the API token</td></tr>
+  </tbody>
+</table>
+
+<p>See <a href="#providers/custom" onclick="event.preventDefault(); navigate('providers/custom')">Custom Providers</a> for more details.</p>
diff --git a/docs/pages/configuration/permissions.html b/docs/pages/configuration/permissions.html
new file mode 100644
index 000000000..3715dd001
--- /dev/null
+++ b/docs/pages/configuration/permissions.html
@@ -0,0 +1,181 @@
+<h1>Permissions</h1>
+<p class="subtitle">Control which tools can execute automatically, require confirmation, or are blocked entirely.</p>
+
+<h2>Overview</h2>
+
+<p>Permissions provide fine-grained control over tool execution. You can configure which tools are auto-approved (run without asking), which require user confirmation, and which are completely blocked.</p>
+
+<div class="callout callout-info">
+  <div class="callout-title">ℹ️ Evaluation Order</div>
+  <p>Permissions are evaluated in this order: <strong>Deny → Allow → Ask</strong>. Deny patterns take priority, then allow patterns, and anything else defaults to asking for user confirmation.</p>
+</div>
+
+<h2>Configuration</h2>
+
+<pre><code class="language-yaml">agents:
+  root:
+    model: openai/gpt-4o
+    description: Agent with permission controls
+    instruction: You are a helpful assistant.
+    permissions:
+      # Auto-approve these tools (no confirmation needed)
+      allow:
+        - "read_file"
+        - "read_*"           # Glob patterns
+        - "shell:cmd=ls*"    # With argument matching
+      
+      # Block these tools entirely
+      deny:
+        - "shell:cmd=sudo*"
+        - "shell:cmd=rm*-rf*"
+        - "dangerous_tool"</code></pre>
+
+<h2>Pattern Syntax</h2>
+
+<p>Permissions support glob-style patterns with optional argument matching:</p>
+
+<h3>Simple Patterns</h3>
+
+<table>
+  <thead><tr><th>Pattern</th><th>Matches</th></tr></thead>
+  <tbody>
+    <tr><td><code>shell</code></td><td>Exact match for <code>shell</code> tool</td></tr>
+    <tr><td><code>read_*</code></td><td>Any tool starting with <code>read_</code></td></tr>
+    <tr><td><code>mcp:github:*</code></td><td>Any GitHub MCP tool</td></tr>
+    <tr><td><code>*</code></td><td>All tools</td></tr>
+  </tbody>
+</table>
+
+<h3>Argument Matching</h3>
+
+<p>You can match tools based on their argument values using <code>tool:arg=pattern</code> syntax:</p>
+
+<pre><code class="language-yaml">permissions:
+  allow:
+    # Allow shell only when cmd starts with "ls" or "cat"
+    - "shell:cmd=ls*"
+    - "shell:cmd=cat*"
+    
+    # Allow edit_file only in specific directory
+    - "edit_file:path=/home/user/safe/*"
+  
+  deny:
+    # Block shell with sudo
+    - "shell:cmd=sudo*"
+    
+    # Block writes to system directories
+    - "write_file:path=/etc/*"
+    - "write_file:path=/usr/*"</code></pre>
+
+<h3>Multiple Argument Conditions</h3>
+
+<p>Chain multiple argument conditions with colons. All conditions must match:</p>
+
+<pre><code class="language-yaml">permissions:
+  allow:
+    # Allow shell with ls in current directory
+    - "shell:cmd=ls*:cwd=."
+  
+  deny:
+    # Block shell with rm -rf anywhere
+    - "shell:cmd=rm*:cmd=*-rf*"</code></pre>
+
+<h2>Glob Pattern Rules</h2>
+
+<p>Patterns follow filepath.Match semantics with some extensions:</p>
+
+<ul>
+  <li><code>*</code> — matches any sequence of characters (including spaces)</li>
+  <li><code>?</code> — matches any single character</li>
+  <li><code>[abc]</code> — matches any character in the set</li>
+  <li><code>[a-z]</code> — matches any character in the range</li>
+</ul>
+
+<p>Matching is <strong>case-insensitive</strong>.</p>
+
+<div class="callout callout-tip">
+  <div class="callout-title">💡 Trailing Wildcards</div>
+  <p>Trailing wildcards like <code>sudo*</code> match any characters including spaces, so <code>sudo*</code> matches <code>sudo rm -rf /</code>.</p>
+</div>
+
+<h2>Decision Types</h2>
+
+<table>
+  <thead><tr><th>Decision</th><th>Behavior</th></tr></thead>
+  <tbody>
+    <tr><td><strong>Allow</strong></td><td>Tool executes immediately without user confirmation</td></tr>
+    <tr><td><strong>Ask</strong></td><td>User must confirm before tool executes (default)</td></tr>
+    <tr><td><strong>Deny</strong></td><td>Tool is blocked and returns an error to the agent</td></tr>
+  </tbody>
+</table>
+
+<h2>Examples</h2>
+
+<h3>Read-Only Agent</h3>
+
+<p>Allow all read operations, block all writes:</p>
+
+<pre><code class="language-yaml">permissions:
+  allow:
+    - "read_file"
+    - "read_multiple_files"
+    - "list_directory"
+    - "directory_tree"
+    - "search_files_content"
+  deny:
+    - "write_file"
+    - "edit_file"
+    - "shell"</code></pre>
+
+<h3>Safe Shell Agent</h3>
+
+<p>Allow specific safe commands, block dangerous ones:</p>
+
+<pre><code class="language-yaml">permissions:
+  allow:
+    - "shell:cmd=ls*"
+    - "shell:cmd=cat*"
+    - "shell:cmd=grep*"
+    - "shell:cmd=find*"
+    - "shell:cmd=head*"
+    - "shell:cmd=tail*"
+    - "shell:cmd=wc*"
+  deny:
+    - "shell:cmd=sudo*"
+    - "shell:cmd=rm*"
+    - "shell:cmd=mv*"
+    - "shell:cmd=chmod*"
+    - "shell:cmd=chown*"</code></pre>
+
+<h3>MCP Tool Permissions</h3>
+
+<p>Control MCP tools by their qualified names:</p>
+
+<pre><code class="language-yaml">permissions:
+  allow:
+    # Allow all GitHub read operations
+    - "mcp:github:get_*"
+    - "mcp:github:list_*"
+    - "mcp:github:search_*"
+  deny:
+    # Block destructive GitHub operations
+    - "mcp:github:delete_*"
+    - "mcp:github:close_*"</code></pre>
+
+<h2>Combining with Hooks</h2>
+
+<p>Permissions work alongside <a href="#configuration/hooks" onclick="event.preventDefault(); navigate('configuration/hooks')">hooks</a>. The evaluation order is:</p>
+
+<ol>
+  <li>Check <strong>deny</strong> patterns — if matched, tool is blocked</li>
+  <li>Check <strong>allow</strong> patterns — if matched, tool is auto-approved</li>
+  <li>Run <strong>pre_tool_use hooks</strong> — hooks can allow, deny, or ask</li>
+  <li>If no decision, <strong>ask user</strong> for confirmation</li>
+</ol>
+
+<p>Hooks can override allow decisions but cannot override deny decisions.</p>
+
+<div class="callout callout-warning">
+  <div class="callout-title">⚠️ Security Note</div>
+  <p>Permissions are enforced client-side. They help prevent accidental operations but should not be relied upon as a security boundary for untrusted agents. For stronger isolation, use <a href="#configuration/sandbox" onclick="event.preventDefault(); navigate('configuration/sandbox')">sandbox mode</a>.</p>
+</div>
diff --git a/docs/pages/configuration/routing.html b/docs/pages/configuration/routing.html
new file mode 100644
index 000000000..8dbb9f6aa
--- /dev/null
+++ b/docs/pages/configuration/routing.html
@@ -0,0 +1,160 @@
+<h1>Model Routing</h1>
+<p class="subtitle">Route requests to different models based on the content of user messages.</p>
+
+<h2>Overview</h2>
+
+<p>Model routing lets you define a "router" model that automatically selects the best underlying model based on the user's message. This is useful for cost optimization, specialized handling, or load balancing across models.</p>
+
+<div class="callout callout-info">
+  <div class="callout-title">ℹ️ How It Works</div>
+  <p>cagent uses NLP-based text similarity (via Bleve full-text search) to match user messages against example phrases you define. The route with the best-matching examples wins, and that model handles the request.</p>
+</div>
+
+<h2>Configuration</h2>
+
+<p>Add <code>routing</code> rules to any model definition. The model's <code>provider</code>/<code>model</code> fields become the fallback when no route matches:</p>
+
+<pre><code class="language-yaml">models:
+  smart_router:
+    # Fallback model when no routing rule matches
+    provider: openai
+    model: gpt-4o-mini
+    
+    # Routing rules
+    routing:
+      - model: anthropic/claude-sonnet-4-0
+        examples:
+          - "Write a detailed technical document"
+          - "Help me architect this system"
+          - "Review this code for security issues"
+          - "Explain this complex algorithm"
+      
+      - model: openai/gpt-4o
+        examples:
+          - "Generate some creative ideas"
+          - "Write a story about"
+          - "Help me brainstorm"
+          - "Come up with names for"
+      
+      - model: openai/gpt-4o-mini
+        examples:
+          - "What time is it"
+          - "Convert this to JSON"
+          - "Simple math calculation"
+          - "Translate this word"
+
+agents:
+  root:
+    model: smart_router
+    description: Assistant with intelligent model routing
+    instruction: You are a helpful assistant.</code></pre>
+
+<h2>Routing Rules</h2>
+
+<p>Each routing rule has:</p>
+
+<table>
+  <thead><tr><th>Field</th><th>Type</th><th>Required</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>model</code></td><td>string</td><td>✓</td><td>Target model (inline format or reference to <code>models</code> section)</td></tr>
+    <tr><td><code>examples</code></td><td>array</td><td>✓</td><td>Example phrases that should route to this model</td></tr>
+  </tbody>
+</table>
+
+<h2>Matching Behavior</h2>
+
+<p>The router:</p>
+
+<ol>
+  <li>Extracts the last user message from the conversation</li>
+  <li>Searches all examples using full-text search</li>
+  <li>Aggregates match scores by route (best score per route wins)</li>
+  <li>Selects the route with the highest overall score</li>
+  <li>Falls back to the base model if no good match is found</li>
+</ol>
+
+<div class="callout callout-tip">
+  <div class="callout-title">💡 Writing Good Examples</div>
+  <ul>
+    <li>Use diverse phrasing that captures the intent</li>
+    <li>Include keywords users actually use</li>
+    <li>Add 5-10 examples per route for best results</li>
+    <li>Examples don't need to be exact matches — the router uses semantic similarity</li>
+  </ul>
+</div>
+
+<h2>Use Cases</h2>
+
+<h3>Cost Optimization</h3>
+
+<p>Route simple queries to cheaper models:</p>
+
+<pre><code class="language-yaml">models:
+  cost_optimizer:
+    provider: openai
+    model: gpt-4o-mini  # Cheap fallback
+    routing:
+      - model: anthropic/claude-sonnet-4-0
+        examples:
+          - "Complex analysis"
+          - "Detailed research"
+          - "Multi-step reasoning"</code></pre>
+
+<h3>Specialized Models</h3>
+
+<p>Route coding tasks to code-specialized models:</p>
+
+<pre><code class="language-yaml">models:
+  task_router:
+    provider: openai
+    model: gpt-4o  # General fallback
+    routing:
+      - model: anthropic/claude-sonnet-4-0
+        examples:
+          - "Write code"
+          - "Debug this function"
+          - "Review my implementation"
+          - "Fix this bug"
+      - model: openai/gpt-4o
+        examples:
+          - "Write a blog post"
+          - "Help me with writing"
+          - "Summarize this document"</code></pre>
+
+<h3>Load Balancing</h3>
+
+<p>Distribute load across equivalent models from different providers:</p>
+
+<pre><code class="language-yaml">models:
+  load_balancer:
+    provider: openai
+    model: gpt-4o
+    routing:
+      - model: anthropic/claude-sonnet-4-0
+        examples:
+          - "First request pattern"
+          - "Another request type"
+      - model: google/gemini-2.5-flash
+        examples:
+          - "Different request pattern"
+          - "Alternative query style"</code></pre>
+
+<h2>Debugging</h2>
+
+<p>Enable debug logging to see routing decisions:</p>
+
+<pre><code class="language-bash">$ cagent run config.yaml --debug</code></pre>
+
+<p>Look for log entries like:</p>
+
+<pre><code class="language-text">"Rule-based router selected model" router=smart_router selected_model=anthropic/claude-sonnet-4-0
+"Route matched" model=anthropic/claude-sonnet-4-0 score=2.45</code></pre>
+
+<div class="callout callout-warning">
+  <div class="callout-title">⚠️ Limitations</div>
+  <ul>
+    <li>Routing only considers the last user message, not full conversation context</li>
+    <li>Very short messages may not match well — consider your fallback carefully</li>
+    <li>Each routed model creates a separate provider connection</li>
+  </ul>
+</div>
diff --git a/docs/pages/configuration/sandbox.html b/docs/pages/configuration/sandbox.html
new file mode 100644
index 000000000..cfbf55f6d
--- /dev/null
+++ b/docs/pages/configuration/sandbox.html
@@ -0,0 +1,152 @@
+<h1>Sandbox Mode</h1>
+<p class="subtitle">Run shell commands in an isolated Docker container for enhanced security.</p>
+
+<h2>Overview</h2>
+
+<p>Sandbox mode runs shell tool commands inside a Docker container instead of directly on the host system. This provides an additional layer of isolation, limiting the potential impact of unintended or malicious commands.</p>
+
+<div class="callout callout-info">
+  <div class="callout-title">ℹ️ Requirements</div>
+  <p>Sandbox mode requires Docker to be installed and running on the host system.</p>
+</div>
+
+<h2>Configuration</h2>
+
+<pre><code class="language-yaml">agents:
+  root:
+    model: openai/gpt-4o
+    description: Agent with sandboxed shell
+    instruction: You are a helpful assistant.
+    toolsets:
+      - type: shell
+    sandbox:
+      image: alpine:latest    # Docker image to use
+      paths:                  # Directories to mount
+        - "."                 # Current directory (read-write)
+        - "/data:ro"          # Read-only mount</code></pre>
+
+<h2>Properties</h2>
+
+<table>
+  <thead><tr><th>Property</th><th>Type</th><th>Default</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>image</code></td><td>string</td><td><code>alpine:latest</code></td><td>Docker image to use for the sandbox container</td></tr>
+    <tr><td><code>paths</code></td><td>array</td><td><code>[]</code></td><td>Host paths to mount into the container</td></tr>
+  </tbody>
+</table>
+
+<h2>Path Mounting</h2>
+
+<p>Paths can be specified with optional access modes:</p>
+
+<table>
+  <thead><tr><th>Format</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>/path</code></td><td>Mount with read-write access (default)</td></tr>
+    <tr><td><code>/path:rw</code></td><td>Explicitly read-write</td></tr>
+    <tr><td><code>/path:ro</code></td><td>Read-only mount</td></tr>
+    <tr><td><code>.</code></td><td>Current working directory</td></tr>
+    <tr><td><code>./relative</code></td><td>Relative path (resolved from working directory)</td></tr>
+  </tbody>
+</table>
+
+<p>Paths are mounted at the same location inside the container as on the host, so file paths in commands work the same way.</p>
+
+<h2>Example: Development Agent</h2>
+
+<pre><code class="language-yaml">agents:
+  developer:
+    model: anthropic/claude-sonnet-4-0
+    description: Development agent with sandboxed shell
+    instruction: |
+      You are a software developer. Use the shell tool to run
+      build commands and tests. Your shell runs in a sandbox.
+    toolsets:
+      - type: shell
+      - type: filesystem
+    sandbox:
+      image: node:20-alpine    # Node.js environment
+      paths:
+        - "."                  # Project directory
+        - "/tmp:rw"            # Temp directory for builds</code></pre>
+
+<h2>How It Works</h2>
+
+<ol>
+  <li>When the agent first uses the shell tool, cagent starts a Docker container</li>
+  <li>The container runs with the specified image and mounted paths</li>
+  <li>Shell commands execute inside the container via <code>docker exec</code></li>
+  <li>The container persists for the session (commands share state)</li>
+  <li>When the session ends, the container is automatically stopped and removed</li>
+</ol>
+
+<h2>Container Configuration</h2>
+
+<p>Sandbox containers are started with these Docker options:</p>
+
+<ul>
+  <li><code>--rm</code> — Automatically remove when stopped</li>
+  <li><code>--init</code> — Use init process for proper signal handling</li>
+  <li><code>--network host</code> — Share host network (commands can access network)</li>
+  <li>Environment variables from host are forwarded to container</li>
+</ul>
+
+<h2>Orphan Container Cleanup</h2>
+
+<p>If cagent crashes or is killed, sandbox containers may be left running. cagent automatically cleans up orphaned containers from previous runs when it starts. Containers are identified by labels and the PID of the cagent process that created them.</p>
+
+<h2>Choosing an Image</h2>
+
+<p>Select a Docker image that has the tools your agent needs:</p>
+
+<table>
+  <thead><tr><th>Use Case</th><th>Suggested Image</th></tr></thead>
+  <tbody>
+    <tr><td>General scripting</td><td><code>alpine:latest</code></td></tr>
+    <tr><td>Node.js development</td><td><code>node:20-alpine</code></td></tr>
+    <tr><td>Python development</td><td><code>python:3.12-alpine</code></td></tr>
+    <tr><td>Go development</td><td><code>golang:1.23-alpine</code></td></tr>
+    <tr><td>Full Linux environment</td><td><code>ubuntu:24.04</code></td></tr>
+  </tbody>
+</table>
+
+<div class="callout callout-tip">
+  <div class="callout-title">💡 Custom Images</div>
+  <p>For complex setups, build a custom Docker image with all required tools pre-installed. This avoids installation time during agent execution.</p>
+</div>
+
+<div class="callout callout-warning">
+  <div class="callout-title">⚠️ Limitations</div>
+  <ul>
+    <li>Only the <code>shell</code> tool runs in the sandbox; other tools (filesystem, MCP) run on the host</li>
+    <li>Host network access means network-based attacks are still possible</li>
+    <li>Mounted paths are accessible according to their access mode</li>
+    <li>Container starts fresh each session (no persistence between sessions)</li>
+  </ul>
+</div>
+
+<h2>Combining with Permissions</h2>
+
+<p>For defense in depth, combine sandbox mode with <a href="#configuration/permissions" onclick="event.preventDefault(); navigate('configuration/permissions')">permissions</a>:</p>
+
+<pre><code class="language-yaml">agents:
+  root:
+    model: openai/gpt-4o
+    description: Secure development agent
+    instruction: You are a helpful assistant.
+    toolsets:
+      - type: shell
+      - type: filesystem
+    sandbox:
+      image: node:20-alpine
+      paths:
+        - ".:rw"
+    permissions:
+      allow:
+        - "shell:cmd=npm*"
+        - "shell:cmd=node*"
+        - "shell:cmd=ls*"
+      deny:
+        - "shell:cmd=sudo*"
+        - "shell:cmd=curl*"
+        - "shell:cmd=wget*"</code></pre>
diff --git a/docs/pages/configuration/structured-output.html b/docs/pages/configuration/structured-output.html
new file mode 100644
index 000000000..0445ed56a
--- /dev/null
+++ b/docs/pages/configuration/structured-output.html
@@ -0,0 +1,213 @@
+<h1>Structured Output</h1>
+<p class="subtitle">Force the agent to respond with JSON matching a specific schema.</p>
+
+<h2>Overview</h2>
+
+<p>Structured output constrains the agent's responses to match a predefined JSON schema. This is useful for building agents that need to produce machine-readable output for downstream processing, API responses, or integration with other systems.</p>
+
+<div class="callout callout-info">
+  <div class="callout-title">ℹ️ When to Use</div>
+  <ul>
+    <li>Building API endpoints that need consistent JSON responses</li>
+    <li>Data extraction and transformation pipelines</li>
+    <li>Agents that feed into other automated systems</li>
+    <li>Ensuring predictable output format for parsing</li>
+  </ul>
+</div>
+
+<h2>Configuration</h2>
+
+<pre><code class="language-yaml">agents:
+  analyzer:
+    model: openai/gpt-4o
+    description: Code analyzer that outputs structured results
+    instruction: |
+      Analyze the provided code and identify issues.
+      Return your findings in the structured format.
+    structured_output:
+      name: analysis_result
+      description: Code analysis findings
+      strict: true
+      schema:
+        type: object
+        properties:
+          issues:
+            type: array
+            items:
+              type: object
+              properties:
+                severity:
+                  type: string
+                  enum: ["error", "warning", "info"]
+                line:
+                  type: integer
+                message:
+                  type: string
+              required: ["severity", "line", "message"]
+          summary:
+            type: string
+        required: ["issues", "summary"]</code></pre>
+
+<h2>Properties</h2>
+
+<table>
+  <thead><tr><th>Property</th><th>Type</th><th>Required</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>name</code></td><td>string</td><td>✓</td><td>Name identifier for the output schema</td></tr>
+    <tr><td><code>description</code></td><td>string</td><td>✗</td><td>Description of what the output represents</td></tr>
+    <tr><td><code>strict</code></td><td>boolean</td><td>✗</td><td>Enforce strict schema validation (default: <code>false</code>)</td></tr>
+    <tr><td><code>schema</code></td><td>object</td><td>✓</td><td>JSON Schema defining the output structure</td></tr>
+  </tbody>
+</table>
+
+<h2>Schema Format</h2>
+
+<p>The schema follows <a href="https://json-schema.org/" target="_blank" rel="noopener noreferrer">JSON Schema</a> specification. Common schema types:</p>
+
+<h3>Simple Object</h3>
+
+<pre><code class="language-yaml">schema:
+  type: object
+  properties:
+    name:
+      type: string
+    count:
+      type: integer
+    active:
+      type: boolean
+  required: ["name", "count"]</code></pre>
+
+<h3>Array of Objects</h3>
+
+<pre><code class="language-yaml">schema:
+  type: object
+  properties:
+    items:
+      type: array
+      items:
+        type: object
+        properties:
+          id:
+            type: string
+          value:
+            type: number
+        required: ["id", "value"]
+  required: ["items"]</code></pre>
+
+<h3>Enum Values</h3>
+
+<pre><code class="language-yaml">schema:
+  type: object
+  properties:
+    status:
+      type: string
+      enum: ["pending", "approved", "rejected"]
+    priority:
+      type: string
+      enum: ["low", "medium", "high", "critical"]
+  required: ["status"]</code></pre>
+
+<h2>Strict Mode</h2>
+
+<p>When <code>strict: true</code>, the model is constrained to only produce output that exactly matches the schema. This provides stronger guarantees but may limit the model's flexibility.</p>
+
+<div class="cards">
+  <div class="card" style="cursor:default;">
+    <h3>strict: false (default)</h3>
+    <p>Model aims to match schema but may include additional fields or slight variations.</p>
+  </div>
+  <div class="card" style="cursor:default;">
+    <h3>strict: true</h3>
+    <p>Model output is constrained to exactly match the schema. Stronger guarantees.</p>
+  </div>
+</div>
+
+<h2>Provider Support</h2>
+
+<p>Structured output support varies by provider:</p>
+
+<table>
+  <thead><tr><th>Provider</th><th>Support</th><th>Notes</th></tr></thead>
+  <tbody>
+    <tr><td>OpenAI</td><td>✓ Full</td><td>Native JSON mode with schema validation</td></tr>
+    <tr><td>Anthropic</td><td>✓ Full</td><td>Tool-based structured output</td></tr>
+    <tr><td>Google Gemini</td><td>✓ Full</td><td>Native JSON mode</td></tr>
+    <tr><td>AWS Bedrock</td><td>✓ Partial</td><td>Depends on underlying model</td></tr>
+    <tr><td>DMR</td><td>⚠️ Limited</td><td>Depends on model capabilities</td></tr>
+  </tbody>
+</table>
+
+<h2>Example: Data Extraction Agent</h2>
+
+<pre><code class="language-yaml">agents:
+  extractor:
+    model: openai/gpt-4o
+    description: Extract structured data from text
+    instruction: |
+      Extract contact information from the provided text.
+      Return all found contacts in the structured format.
+    structured_output:
+      name: contacts
+      description: Extracted contact information
+      strict: true
+      schema:
+        type: object
+        properties:
+          contacts:
+            type: array
+            items:
+              type: object
+              properties:
+                name:
+                  type: string
+                  description: Full name of the contact
+                email:
+                  type: string
+                  description: Email address
+                phone:
+                  type: string
+                  description: Phone number
+                company:
+                  type: string
+                  description: Company or organization
+              required: ["name"]
+          total_found:
+            type: integer
+            description: Total number of contacts found
+        required: ["contacts", "total_found"]</code></pre>
+
+<h2>Example: Classification Agent</h2>
+
+<pre><code class="language-yaml">agents:
+  classifier:
+    model: anthropic/claude-sonnet-4-0
+    description: Classify support tickets
+    instruction: |
+      Classify the support ticket into the appropriate category
+      and priority level based on its content.
+    structured_output:
+      name: ticket_classification
+      strict: true
+      schema:
+        type: object
+        properties:
+          category:
+            type: string
+            enum: ["billing", "technical", "account", "feature_request", "other"]
+          priority:
+            type: string
+            enum: ["low", "medium", "high", "urgent"]
+          confidence:
+            type: number
+            minimum: 0
+            maximum: 1
+            description: Confidence score between 0 and 1
+          reasoning:
+            type: string
+            description: Brief explanation for the classification
+        required: ["category", "priority", "confidence"]</code></pre>
+
+<div class="callout callout-warning">
+  <div class="callout-title">⚠️ Tool Limitations</div>
+  <p>When using structured output, the agent typically cannot use tools since its response format is constrained to the schema. Design your agent workflow accordingly — structured output agents work best for single-turn analysis or extraction tasks.</p>
+</div>
diff --git a/docs/pages/configuration/tools.html b/docs/pages/configuration/tools.html
index 3772c28e8..a760d5660 100644
--- a/docs/pages/configuration/tools.html
+++ b/docs/pages/configuration/tools.html
@@ -8,7 +8,11 @@ <h2>Built-in Tools</h2>
 <h3>Filesystem</h3>
 <p>Read, write, list, search, and navigate files in the working directory.</p>
 <pre><code class="language-yaml">toolsets:
-  - type: filesystem</code></pre>
+  - type: filesystem
+    ignore_vcs: false          # Optional: ignore .gitignore files
+    post_edit:                 # Optional: run commands after file edits
+      - path: "*.go"
+        cmd: "gofmt -w ${file}"</code></pre>
 
 <table>
   <thead><tr><th>Operation</th><th>Description</th></tr></thead>
@@ -23,6 +27,16 @@ <h3>Filesystem</h3>
   </tbody>
 </table>
 
+<table>
+  <thead><tr><th>Property</th><th>Type</th><th>Default</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>ignore_vcs</code></td><td>boolean</td><td><code>false</code></td><td>When <code>true</code>, ignores <code>.gitignore</code> patterns and includes all files</td></tr>
+    <tr><td><code>post_edit</code></td><td>array</td><td><code>[]</code></td><td>Commands to run after editing files matching a path pattern</td></tr>
+    <tr><td><code>post_edit[].path</code></td><td>string</td><td>—</td><td>Glob pattern for files (e.g., <code>*.go</code>, <code>src/**/*.ts</code>)</td></tr>
+    <tr><td><code>post_edit[].cmd</code></td><td>string</td><td>—</td><td>Command to run (use <code>${file}</code> for the edited file path)</td></tr>
+  </tbody>
+</table>
+
 <div class="callout callout-tip">
   <div class="callout-title">💡 Tip</div>
   <p>The filesystem tool resolves paths relative to the working directory. Agents can also use absolute paths.</p>
@@ -31,10 +45,21 @@ <h3>Filesystem</h3>
 <h3>Shell</h3>
 <p>Execute arbitrary shell commands. Each call runs in a fresh, isolated shell session — no state persists between calls.</p>
 <pre><code class="language-yaml">toolsets:
-  - type: shell</code></pre>
+  - type: shell
+    env:                        # Optional: environment variables
+      MY_VAR: "value"
+      PATH: "${PATH}:/custom/bin"</code></pre>
 
 <p>The agent has access to the full system shell and environment variables. Commands have a default 30-second timeout. Requires user confirmation unless <code>--yolo</code> is used.</p>
 
+<table>
+  <thead><tr><th>Property</th><th>Type</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>env</code></td><td>object</td><td>Environment variables to set for all shell commands</td></tr>
+    <tr><td><code>sandbox</code></td><td>object</td><td>Run commands in a Docker container. See <a href="#configuration/sandbox" onclick="event.preventDefault(); navigate('configuration/sandbox')">Sandbox Mode</a>.</td></tr>
+  </tbody>
+</table>
+
 <h3>Think</h3>
 <p>Step-by-step reasoning scratchpad. The agent writes its thoughts without producing visible output — ideal for planning, decomposition, and decision-making.</p>
 <pre><code class="language-yaml">toolsets:
@@ -45,7 +70,8 @@ <h3>Think</h3>
 <h3>Todo</h3>
 <p>Task list management. Agents can create, update, and track tasks with status (pending, in-progress, completed).</p>
 <pre><code class="language-yaml">toolsets:
-  - type: todo</code></pre>
+  - type: todo
+    shared: false              # Optional: share todos across agents</code></pre>
 
 <table>
   <thead><tr><th>Operation</th><th>Description</th></tr></thead>
@@ -57,6 +83,13 @@ <h3>Todo</h3>
   </tbody>
 </table>
 
+<table>
+  <thead><tr><th>Property</th><th>Type</th><th>Default</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>shared</code></td><td>boolean</td><td><code>false</code></td><td>When <code>true</code>, todos are shared across all agents in a multi-agent config</td></tr>
+  </tbody>
+</table>
+
 <h3>Memory</h3>
 <p>Persistent key-value storage backed by SQLite. Data survives across sessions, letting agents remember context, user preferences, and past decisions.</p>
 <pre><code class="language-yaml">toolsets:
@@ -73,37 +106,152 @@ <h3>Memory</h3>
 <h3>Fetch</h3>
 <p>Make HTTP requests to external APIs and web services.</p>
 <pre><code class="language-yaml">toolsets:
-  - type: fetch</code></pre>
+  - type: fetch
+    timeout: 30                # Optional: request timeout in seconds</code></pre>
 
 <p>Supports GET, POST, PUT, DELETE, and other HTTP methods. The agent can set headers, send request bodies, and receive response data. Useful for calling REST APIs, reading web pages, and downloading content.</p>
 
+<table>
+  <thead><tr><th>Property</th><th>Type</th><th>Default</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>timeout</code></td><td>int</td><td><code>30</code></td><td>Request timeout in seconds</td></tr>
+  </tbody>
+</table>
+
 <h3>Script</h3>
 <p>Define custom shell scripts as named tools. Unlike the generic <code>shell</code> tool, scripts are predefined and can be given descriptive names — ideal for exposing safe, well-scoped operations.</p>
+
+<p><strong>Simple format:</strong></p>
 <pre><code class="language-yaml">toolsets:
   - type: script
-    scripts:
-      - name: run_tests
+    shell:
+      run_tests:
+        cmd: task test
         description: Run the project test suite
-        command: task test
-      - name: lint
+      lint:
+        cmd: task lint
         description: Run the linter
-        command: task lint
-      - name: deploy_staging
-        description: Deploy to the staging environment
-        command: ./scripts/deploy.sh staging</code></pre>
+      deploy:
+        cmd: ./scripts/deploy.sh ${env}
+        description: Deploy to an environment
+        args:
+          env:
+            type: string
+            enum: [staging, production]
+        required: [env]</code></pre>
 
 <table>
   <thead><tr><th>Property</th><th>Type</th><th>Description</th></tr></thead>
   <tbody>
-    <tr><td><code>scripts[].name</code></td><td>string</td><td>Tool name the agent sees and calls</td></tr>
-    <tr><td><code>scripts[].description</code></td><td>string</td><td>Description shown to the model for tool selection</td></tr>
-    <tr><td><code>scripts[].command</code></td><td>string</td><td>Shell command to execute when the tool is called</td></tr>
+    <tr><td><code>shell.&lt;name&gt;.cmd</code></td><td>string</td><td>Shell command to execute (supports <code>${arg}</code> interpolation)</td></tr>
+    <tr><td><code>shell.&lt;name&gt;.description</code></td><td>string</td><td>Description shown to the model</td></tr>
+    <tr><td><code>shell.&lt;name&gt;.args</code></td><td>object</td><td>Parameter definitions (JSON Schema properties)</td></tr>
+    <tr><td><code>shell.&lt;name&gt;.required</code></td><td>array</td><td>Required parameter names</td></tr>
+    <tr><td><code>shell.&lt;name&gt;.env</code></td><td>object</td><td>Environment variables for this script</td></tr>
+    <tr><td><code>shell.&lt;name&gt;.working_dir</code></td><td>string</td><td>Working directory for script execution</td></tr>
   </tbody>
 </table>
 
 <h3>Transfer Task</h3>
 <p>The <code>transfer_task</code> tool is automatically available when an agent has <code>sub_agents</code>. Allows delegating tasks to sub-agents. No configuration needed — it's enabled implicitly.</p>
 
+<h3>LSP (Language Server Protocol)</h3>
+<p>Connect to language servers for code intelligence: go-to-definition, find references, diagnostics, and more.</p>
+<pre><code class="language-yaml">toolsets:
+  - type: lsp
+    command: gopls
+    args: []
+    file_types: [".go"]</code></pre>
+
+<table>
+  <thead><tr><th>Property</th><th>Type</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>command</code></td><td>string</td><td>LSP server executable command</td></tr>
+    <tr><td><code>args</code></td><td>array</td><td>Command-line arguments for the LSP server</td></tr>
+    <tr><td><code>env</code></td><td>object</td><td>Environment variables for the LSP process</td></tr>
+    <tr><td><code>file_types</code></td><td>array</td><td>File extensions this LSP handles</td></tr>
+  </tbody>
+</table>
+
+<p>See <a href="#tools/lsp" onclick="event.preventDefault(); navigate('tools/lsp')">LSP Tool</a> for full documentation.</p>
+
+<h3>User Prompt</h3>
+<p>Ask users questions and collect interactive input during agent execution.</p>
+<pre><code class="language-yaml">toolsets:
+  - type: user_prompt</code></pre>
+
+<p>The agent can use this tool to ask questions, present choices, or collect information from the user. Supports JSON Schema for structured input validation.</p>
+
+<p>See <a href="#tools/user-prompt" onclick="event.preventDefault(); navigate('tools/user-prompt')">User Prompt Tool</a> for full documentation.</p>
+
+<h3>API</h3>
+<p>Create custom tools that call HTTP APIs without writing code.</p>
+<pre><code class="language-yaml">toolsets:
+  - type: api
+    name: get_weather
+    method: GET
+    endpoint: "https://api.weather.example/v1/current?city=${city}"
+    instruction: Get current weather for a city
+    args:
+      city:
+        type: string
+        description: City name
+    required: ["city"]
+    headers:
+      Authorization: "Bearer ${env.WEATHER_API_KEY}"</code></pre>
+
+<table>
+  <thead><tr><th>Property</th><th>Type</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>name</code></td><td>string</td><td>Tool name</td></tr>
+    <tr><td><code>method</code></td><td>string</td><td>HTTP method: <code>GET</code> or <code>POST</code></td></tr>
+    <tr><td><code>endpoint</code></td><td>string</td><td>URL with <code>${param}</code> interpolation</td></tr>
+    <tr><td><code>args</code></td><td>object</td><td>Parameter definitions</td></tr>
+    <tr><td><code>required</code></td><td>array</td><td>Required parameter names</td></tr>
+    <tr><td><code>headers</code></td><td>object</td><td>HTTP headers (supports <code>${env.VAR}</code>)</td></tr>
+  </tbody>
+</table>
+
+<p>See <a href="#tools/api" onclick="event.preventDefault(); navigate('tools/api')">API Tool</a> for full documentation.</p>
+
+<h3>Handoff</h3>
+<p>Delegate tasks to remote agents via the A2A (Agent-to-Agent) protocol.</p>
+<pre><code class="language-yaml">toolsets:
+  - type: handoff
+    name: research_agent
+    description: Specialized research agent
+    url: "http://localhost:8080/a2a"
+    timeout: 5m</code></pre>
+
+<table>
+  <thead><tr><th>Property</th><th>Type</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>name</code></td><td>string</td><td>Tool name for delegation</td></tr>
+    <tr><td><code>description</code></td><td>string</td><td>Description for the agent</td></tr>
+    <tr><td><code>url</code></td><td>string</td><td>A2A server endpoint URL</td></tr>
+    <tr><td><code>timeout</code></td><td>string</td><td>Request timeout (default: 5m)</td></tr>
+  </tbody>
+</table>
+
+<p>See <a href="#features/a2a" onclick="event.preventDefault(); navigate('features/a2a')">A2A Protocol</a> for full documentation.</p>
+
+<h3>A2A (Agent-to-Agent)</h3>
+<p>Connect to remote agents via the A2A protocol. Similar to handoff but configured as a toolset.</p>
+<pre><code class="language-yaml">toolsets:
+  - type: a2a
+    name: research_agent
+    url: "http://localhost:8080/a2a"</code></pre>
+
+<table>
+  <thead><tr><th>Property</th><th>Type</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>name</code></td><td>string</td><td>Tool name for the remote agent</td></tr>
+    <tr><td><code>url</code></td><td>string</td><td>A2A server endpoint URL</td></tr>
+  </tbody>
+</table>
+
+<p>See <a href="#features/a2a" onclick="event.preventDefault(); navigate('features/a2a')">A2A Protocol</a> for full documentation.</p>
+
 <h2>MCP Tools</h2>
 
 <p>Extend agents with external tools via the <a href="https://modelcontextprotocol.io/" target="_blank" rel="noopener noreferrer">Model Context Protocol</a>.</p>
@@ -128,6 +276,7 @@ <h3>Docker MCP (Recommended)</h3>
     <tr><td><code>ref</code></td><td>string</td><td>Docker MCP reference (<code>docker:name</code>)</td></tr>
     <tr><td><code>tools</code></td><td>array</td><td>Optional: only expose these tools</td></tr>
     <tr><td><code>instruction</code></td><td>string</td><td>Custom instructions injected into the agent's context</td></tr>
+    <tr><td><code>config</code></td><td>any</td><td>MCP server-specific configuration (passed during initialization)</td></tr>
   </tbody>
 </table>
 
@@ -220,6 +369,11 @@ <h2>Combined Example</h2>
       - type: todo
       - type: memory
         path: ./dev.db
+      - type: user_prompt
+      # LSP for code intelligence
+      - type: lsp
+        command: gopls
+        file_types: [".go"]
       # Custom scripts
       - type: script
         scripts:
@@ -229,6 +383,12 @@ <h2>Combined Example</h2>
           - name: lint
             description: Run the linter
             command: task lint
+      # Custom API tool
+      - type: api
+        name: get_status
+        method: GET
+        endpoint: "https://api.example.com/status"
+        instruction: Check service health
       # Docker MCP tools
       - type: mcp
         ref: docker:github-official
diff --git a/docs/pages/features/cli.html b/docs/pages/features/cli.html
index 3b85ecc21..14130b21e 100644
--- a/docs/pages/features/cli.html
+++ b/docs/pages/features/cli.html
@@ -19,8 +19,8 @@ <h3><code>cagent run</code></h3>
     <tr><td><code>-a, --agent &lt;name&gt;</code></td><td>Run a specific agent from the config</td></tr>
     <tr><td><code>--yolo</code></td><td>Auto-approve all tool calls</td></tr>
     <tr><td><code>--model &lt;ref&gt;</code></td><td>Override model(s). Use <code>provider/model</code> for all agents, or <code>agent=provider/model</code> for specific agents. Comma-separate multiple overrides.</td></tr>
-    <tr><td><code>--session &lt;id&gt;</code></td><td>Resume a previous session. Supports relative refs (<code>-1</code>, <code>-2</code>)</td></tr>
-    <tr><td><code>--prompt-file &lt;path&gt;</code></td><td>Include file contents as system context</td></tr>
+    <tr><td><code>--session &lt;id&gt;</code></td><td>Resume a previous session. Supports relative refs (<code>-1</code> = last, <code>-2</code> = second to last)</td></tr>
+    <tr><td><code>--prompt-file &lt;path&gt;</code></td><td>Include file contents as additional system context (repeatable)</td></tr>
     <tr><td><code>-c &lt;name&gt;</code></td><td>Run a named command from the YAML config</td></tr>
     <tr><td><code>-d, --debug</code></td><td>Enable debug logging</td></tr>
     <tr><td><code>--log-file &lt;path&gt;</code></td><td>Custom debug log location</td></tr>
@@ -35,7 +35,11 @@ <h3><code>cagent run</code></h3>
 $ cagent run agent.yaml --model anthropic/claude-sonnet-4-0
 $ cagent run agent.yaml --model "dev=openai/gpt-4o,reviewer=anthropic/claude-sonnet-4-0"
 $ cagent run agent.yaml --session -1  # resume last session
-$ cagent run agent.yaml -c df         # run named command</code></pre>
+$ cagent run agent.yaml -c df         # run named command
+$ cagent run agent.yaml --prompt-file ./context.md  # include file as context
+
+# Queue multiple messages (processed in sequence)
+$ cagent run agent.yaml "question 1" "question 2" "question 3"</code></pre>
 
 <h3><code>cagent exec</code></h3>
 <p>Run an agent in non-interactive (headless) mode. No TUI — output goes to stdout.</p>
@@ -116,7 +120,21 @@ <h3><code>cagent push</code> / <code>cagent pull</code></h3>
 <h3><code>cagent eval</code></h3>
 <p>Run agent evaluations.</p>
 
-<pre><code class="language-bash">$ cagent eval eval-config.yaml</code></pre>
+<pre><code class="language-bash">$ cagent eval eval-config.yaml
+
+# With flags
+$ cagent eval agent.yaml ./evals -c 8              # 8 concurrent evaluations
+$ cagent eval agent.yaml --keep-containers         # Keep containers for debugging
+$ cagent eval agent.yaml --only "auth*"            # Only run matching evals</code></pre>
+
+<h3><code>cagent build</code></h3>
+<p>Build agent configuration into a distributable format.</p>
+
+<pre><code class="language-bash">$ cagent build [config] [flags]
+
+# Examples
+$ cagent build agent.yaml
+$ cagent build agent.yaml -o ./dist</code></pre>
 
 <h3><code>cagent alias</code></h3>
 <p>Manage agent aliases for quick access.</p>
diff --git a/docs/pages/features/evaluation.html b/docs/pages/features/evaluation.html
index 0bb6985b8..bca8a52e6 100644
--- a/docs/pages/features/evaluation.html
+++ b/docs/pages/features/evaluation.html
@@ -195,10 +195,16 @@ <h2>Output</h2>
 <ul>
   <li><strong>Console summary</strong> — Pass/fail status per eval with metric breakdowns</li>
   <li><strong>JSON results</strong> — Full structured results for programmatic analysis</li>
-  <li><strong>SQLite database</strong> — Complete sessions for detailed inspection</li>
+  <li><strong>SQLite database</strong> — Complete sessions for detailed investigation and debugging</li>
+  <li><strong>Sessions JSON</strong> — Exported session data for analysis</li>
   <li><strong>Log file</strong> — Debug-level log of the entire evaluation run</li>
 </ul>
 
+<div class="callout callout-tip">
+  <div class="callout-title">💡 Debugging Failed Evals</div>
+  <p>Use <code>--keep-containers</code> to preserve containers after evaluation. You can then inspect them with <code>docker exec</code> to understand why an eval failed. The session database (<code>.db</code> file) contains the full conversation history for each eval.</p>
+</div>
+
 <pre><code class="language-bash">$ cagent eval demo.yaml ./evals
 
   ✓ Counting Files in Local Folder
diff --git a/docs/pages/features/rag.html b/docs/pages/features/rag.html
index 2b7cbc4ae..378d5aa8a 100644
--- a/docs/pages/features/rag.html
+++ b/docs/pages/features/rag.html
@@ -168,13 +168,13 @@ <h2>Configuration Reference</h2>
 <h3>Top-Level RAG Fields</h3>
 
 <table>
-  <thead><tr><th>Field</th><th>Type</th><th>Description</th></tr></thead>
+  <thead><tr><th>Field</th><th>Type</th><th>Default</th><th>Description</th></tr></thead>
   <tbody>
-    <tr><td><code>docs</code></td><td>[]string</td><td>Document paths/directories (shared across strategies)</td></tr>
-    <tr><td><code>description</code></td><td>string</td><td>Human-readable description of this RAG source</td></tr>
-    <tr><td><code>respect_vcs</code></td><td>boolean</td><td>Respect <code>.gitignore</code> files (default: <code>true</code>)</td></tr>
-    <tr><td><code>strategies</code></td><td>[]object</td><td>Array of retrieval strategy configurations</td></tr>
-    <tr><td><code>results</code></td><td>object</td><td>Post-processing: fusion, reranking, deduplication, final limit</td></tr>
+    <tr><td><code>docs</code></td><td>[]string</td><td>—</td><td>Document paths/directories (shared across strategies)</td></tr>
+    <tr><td><code>description</code></td><td>string</td><td>—</td><td>Human-readable description of this RAG source</td></tr>
+    <tr><td><code>respect_vcs</code></td><td>boolean</td><td><code>true</code></td><td>Respect <code>.gitignore</code> files when indexing documents</td></tr>
+    <tr><td><code>strategies</code></td><td>[]object</td><td>—</td><td>Array of retrieval strategy configurations</td></tr>
+    <tr><td><code>results</code></td><td>object</td><td>—</td><td>Post-processing: fusion, reranking, deduplication, final limit</td></tr>
   </tbody>
 </table>
 
@@ -239,8 +239,10 @@ <h3>Results (Post-Processing)</h3>
   <tbody>
     <tr><td><code>fusion.strategy</code></td><td>string</td><td><code>rrf</code></td><td>Fusion method: <code>rrf</code>, <code>weighted</code>, or <code>max</code></td></tr>
     <tr><td><code>fusion.k</code></td><td>int</td><td><code>60</code></td><td>RRF rank constant</td></tr>
-    <tr><td><code>deduplicate</code></td><td>bool</td><td><code>false</code></td><td>Remove duplicate results</td></tr>
-    <tr><td><code>limit</code></td><td>int</td><td><code>5</code></td><td>Final number of results</td></tr>
+    <tr><td><code>deduplicate</code></td><td>bool</td><td><code>true</code></td><td>Remove duplicate results</td></tr>
+    <tr><td><code>limit</code></td><td>int</td><td><code>15</code></td><td>Final number of results</td></tr>
+    <tr><td><code>include_score</code></td><td>bool</td><td><code>false</code></td><td>Include relevance scores in results</td></tr>
+    <tr><td><code>return_full_content</code></td><td>bool</td><td><code>false</code></td><td>Return full document content instead of just matched chunks</td></tr>
     <tr><td><code>reranking.model</code></td><td>string</td><td>—</td><td>Reranking model reference</td></tr>
     <tr><td><code>reranking.top_k</code></td><td>int</td><td>(all)</td><td>Only rerank top K results</td></tr>
     <tr><td><code>reranking.threshold</code></td><td>float</td><td><code>0.5</code></td><td>Minimum relevance score after reranking</td></tr>
diff --git a/docs/pages/features/skills.html b/docs/pages/features/skills.html
index f8204bd54..d2b39fce4 100644
--- a/docs/pages/features/skills.html
+++ b/docs/pages/features/skills.html
@@ -53,9 +53,9 @@ <h3>Global</h3>
 <table>
   <thead><tr><th>Path</th><th>Search Type</th></tr></thead>
   <tbody>
-    <tr><td><code>~/.codex/skills/</code></td><td>Recursive</td></tr>
-    <tr><td><code>~/.claude/skills/</code></td><td>Flat</td></tr>
-    <tr><td><code>~/.agents/skills/</code></td><td>Recursive</td></tr>
+    <tr><td><code>~/.codex/skills/</code></td><td>Recursive (searches all subdirectories)</td></tr>
+    <tr><td><code>~/.claude/skills/</code></td><td>Flat (immediate children only)</td></tr>
+    <tr><td><code>~/.agents/skills/</code></td><td>Recursive (searches all subdirectories)</td></tr>
   </tbody>
 </table>
 
@@ -68,6 +68,22 @@ <h3>Project (from git root to current directory)</h3>
   </tbody>
 </table>
 
+<h2>Invoking Skills</h2>
+
+<p>Skills can be invoked in multiple ways:</p>
+
+<ul>
+  <li><strong>Automatic:</strong> The agent detects when your request matches a skill's description and loads it automatically</li>
+  <li><strong>Explicit:</strong> Reference the skill name in your prompt: "Use the create-dockerfile skill to..."</li>
+  <li><strong>Slash command:</strong> Use <code>/{skill-name}</code> to invoke a skill directly</li>
+</ul>
+
+<pre><code class="language-bash"># In the TUI, invoke skill directly:
+/create-dockerfile
+
+# Or mention it in your message:
+"Create a dockerfile for my Python app (use the create-dockerfile skill)"</code></pre>
+
 <h2>Precedence</h2>
 
 <p>When multiple skills share the same name:</p>
diff --git a/docs/pages/features/tui.html b/docs/pages/features/tui.html
index ac234cc2a..efcc62063 100644
--- a/docs/pages/features/tui.html
+++ b/docs/pages/features/tui.html
@@ -120,13 +120,20 @@ <h2>Keyboard Shortcuts</h2>
   <tbody>
     <tr><td><kbd>Ctrl</kbd>+<kbd>K</kbd></td><td>Open command palette</td></tr>
     <tr><td><kbd>Ctrl</kbd>+<kbd>M</kbd></td><td>Switch model</td></tr>
-    <tr><td><kbd>Ctrl</kbd>+<kbd>R</kbd></td><td>Reverse history search</td></tr>
-    <tr><td><kbd>Ctrl</kbd>+<kbd>L</kbd></td><td>Audio listening mode</td></tr>
-    <tr><td><kbd>Ctrl</kbd>+<kbd>Z</kbd></td><td>Suspend to background</td></tr>
+    <tr><td><kbd>Ctrl</kbd>+<kbd>R</kbd></td><td>Reverse history search (search previous inputs)</td></tr>
+    <tr><td><kbd>Ctrl</kbd>+<kbd>L</kbd></td><td>Start audio listening mode (voice input)</td></tr>
+    <tr><td><kbd>Ctrl</kbd>+<kbd>Z</kbd></td><td>Suspend TUI to background (resume with <code>fg</code>)</td></tr>
+    <tr><td><kbd>Ctrl</kbd>+<kbd>X</kbd></td><td>Clear queued messages</td></tr>
     <tr><td><kbd>Escape</kbd></td><td>Cancel current operation</td></tr>
+    <tr><td><kbd>Enter</kbd></td><td>Send message (or newline with Shift+Enter)</td></tr>
+    <tr><td><kbd>Up</kbd>/<kbd>Down</kbd></td><td>Navigate message history</td></tr>
   </tbody>
 </table>
 
+<h2>History Search</h2>
+
+<p>Press <kbd>Ctrl</kbd>+<kbd>R</kbd> to enter incremental history search mode. Start typing to filter through your previous inputs. Press <kbd>Enter</kbd> to select a match, or <kbd>Escape</kbd> to cancel.</p>
+
 <h2>Theming</h2>
 
 <p>Customize the TUI appearance with built-in or custom themes:</p>
diff --git a/docs/pages/guides/go-sdk.html b/docs/pages/guides/go-sdk.html
new file mode 100644
index 000000000..f9db9b769
--- /dev/null
+++ b/docs/pages/guides/go-sdk.html
@@ -0,0 +1,344 @@
+<h1>Go SDK</h1>
+<p class="subtitle">Use cagent as a Go library to embed AI agents in your applications.</p>
+
+<h2>Overview</h2>
+
+<p>cagent can be used as a Go library, allowing you to build AI agents directly into your Go applications. This gives you full programmatic control over agent creation, tool integration, and execution.</p>
+
+<div class="callout callout-info">
+  <div class="callout-title">ℹ️ Import Path</div>
+  <pre><code class="language-go">import "github.com/docker/cagent/pkg/..."</code></pre>
+</div>
+
+<h2>Core Packages</h2>
+
+<table>
+  <thead><tr><th>Package</th><th>Purpose</th></tr></thead>
+  <tbody>
+    <tr><td><code>pkg/agent</code></td><td>Agent creation and configuration</td></tr>
+    <tr><td><code>pkg/runtime</code></td><td>Agent execution and event streaming</td></tr>
+    <tr><td><code>pkg/session</code></td><td>Conversation state management</td></tr>
+    <tr><td><code>pkg/team</code></td><td>Multi-agent team composition</td></tr>
+    <tr><td><code>pkg/tools</code></td><td>Tool interface and utilities</td></tr>
+    <tr><td><code>pkg/tools/builtin</code></td><td>Built-in tools (shell, filesystem, etc.)</td></tr>
+    <tr><td><code>pkg/model/provider/*</code></td><td>Model provider clients</td></tr>
+    <tr><td><code>pkg/config/latest</code></td><td>Configuration types</td></tr>
+    <tr><td><code>pkg/environment</code></td><td>Environment and secrets</td></tr>
+  </tbody>
+</table>
+
+<h2>Basic Example</h2>
+
+<p>Create a simple agent and run it:</p>
+
+<pre><code class="language-go">package main
+
+import (
+    "context"
+    "fmt"
+    "log"
+    "os/signal"
+    "syscall"
+
+    "github.com/docker/cagent/pkg/agent"
+    "github.com/docker/cagent/pkg/config/latest"
+    "github.com/docker/cagent/pkg/environment"
+    "github.com/docker/cagent/pkg/model/provider/openai"
+    "github.com/docker/cagent/pkg/runtime"
+    "github.com/docker/cagent/pkg/session"
+    "github.com/docker/cagent/pkg/team"
+)
+
+func main() {
+    ctx, cancel := signal.NotifyContext(context.Background(), 
+        syscall.SIGINT, syscall.SIGTERM)
+    defer cancel()
+
+    if err := run(ctx); err != nil {
+        log.Fatal(err)
+    }
+}
+
+func run(ctx context.Context) error {
+    // Create model provider
+    llm, err := openai.NewClient(
+        ctx,
+        &latest.ModelConfig{
+            Provider: "openai",
+            Model:    "gpt-4o",
+        },
+        environment.NewDefaultProvider(),
+    )
+    if err != nil {
+        return err
+    }
+
+    // Create agent
+    assistant := agent.New(
+        "root",
+        "You are a helpful assistant.",
+        agent.WithModel(llm),
+        agent.WithDescription("A helpful assistant"),
+    )
+
+    // Create team and runtime
+    t := team.New(team.WithAgents(assistant))
+    rt, err := runtime.New(t)
+    if err != nil {
+        return err
+    }
+
+    // Run with a user message
+    sess := session.New(
+        session.WithUserMessage("What is 2 + 2?"),
+    )
+    
+    messages, err := rt.Run(ctx, sess)
+    if err != nil {
+        return err
+    }
+
+    // Print the response
+    fmt.Println(messages[len(messages)-1].Message.Content)
+    return nil
+}</code></pre>
+
+<h2>Custom Tools</h2>
+
+<p>Define custom tools for your agent:</p>
+
+<pre><code class="language-go">package main
+
+import (
+    "context"
+    "encoding/json"
+    "fmt"
+
+    "github.com/docker/cagent/pkg/tools"
+)
+
+// Define the tool's input schema
+type AddNumbersArgs struct {
+    A int `json:"a"`
+    B int `json:"b"`
+}
+
+// Implement the tool handler
+func addNumbers(_ context.Context, toolCall tools.ToolCall) (*tools.ToolCallResult, error) {
+    var args AddNumbersArgs
+    if err := json.Unmarshal([]byte(toolCall.Function.Arguments), &args); err != nil {
+        return nil, err
+    }
+
+    result := args.A + args.B
+    return tools.ResultSuccess(fmt.Sprintf("%d", result)), nil
+}
+
+func main() {
+    // Create the tool definition
+    addTool := tools.Tool{
+        Name:        "add",
+        Category:    "math",
+        Description: "Add two numbers together",
+        Parameters:  tools.MustSchemaFor[AddNumbersArgs](),
+        Handler:     addNumbers,
+    }
+
+    // Use with an agent
+    calculator := agent.New(
+        "root",
+        "You are a calculator. Use the add tool for arithmetic.",
+        agent.WithModel(llm),
+        agent.WithTools(addTool),
+    )
+    // ...
+}</code></pre>
+
+<h2>Streaming Responses</h2>
+
+<p>Process events as they happen:</p>
+
+<pre><code class="language-go">func runStreaming(ctx context.Context, rt *runtime.Runtime, sess *session.Session) error {
+    events := rt.RunStream(ctx, sess)
+    
+    for event := range events {
+        switch e := event.(type) {
+        case *runtime.StreamStartedEvent:
+            fmt.Println("Stream started")
+            
+        case *runtime.AgentChoiceEvent:
+            // Print response chunks as they arrive
+            fmt.Print(e.Content)
+            
+        case *runtime.ToolCallEvent:
+            fmt.Printf("\n[Tool call: %s]\n", e.ToolCall.Function.Name)
+            
+        case *runtime.ToolCallConfirmationEvent:
+            // Auto-approve tool calls
+            rt.Resume(ctx, runtime.ResumeRequest{
+                Type: runtime.ResumeTypeApproveSession,
+            })
+            
+        case *runtime.ToolCallResponseEvent:
+            fmt.Printf("[Tool response: %s]\n", e.Response)
+            
+        case *runtime.StreamStoppedEvent:
+            fmt.Println("\nStream stopped")
+            
+        case *runtime.ErrorEvent:
+            return fmt.Errorf("error: %s", e.Error)
+        }
+    }
+    
+    return nil
+}</code></pre>
+
+<h2>Multi-Agent Teams</h2>
+
+<p>Create agents that delegate to sub-agents:</p>
+
+<pre><code class="language-go">package main
+
+import (
+    "github.com/docker/cagent/pkg/agent"
+    "github.com/docker/cagent/pkg/team"
+    "github.com/docker/cagent/pkg/tools/builtin"
+)
+
+func createTeam(llm provider.Provider) *team.Team {
+    // Create a child agent
+    researcher := agent.New(
+        "researcher",
+        "You research topics thoroughly.",
+        agent.WithModel(llm),
+        agent.WithDescription("Research specialist"),
+    )
+
+    // Create root agent with sub-agents
+    coordinator := agent.New(
+        "root",
+        "You coordinate research tasks.",
+        agent.WithModel(llm),
+        agent.WithDescription("Team coordinator"),
+        agent.WithSubAgents(researcher),
+        agent.WithToolSets(builtin.NewTransferTaskTool()),
+    )
+
+    return team.New(team.WithAgents(coordinator, researcher))
+}</code></pre>
+
+<h2>Built-in Tools</h2>
+
+<p>Use cagent's built-in tools:</p>
+
+<pre><code class="language-go">import (
+    "github.com/docker/cagent/pkg/config"
+    "github.com/docker/cagent/pkg/tools/builtin"
+)
+
+func createAgentWithBuiltinTools(llm provider.Provider) *agent.Agent {
+    // Runtime config for tools that need it
+    rtConfig := &config.RuntimeConfig{
+        Config: config.Config{
+            WorkingDir: "/path/to/workdir",
+        },
+    }
+
+    return agent.New(
+        "root",
+        "You are a developer assistant.",
+        agent.WithModel(llm),
+        agent.WithToolSets(
+            // Shell tool for running commands
+            builtin.NewShellTool(os.Environ(), rtConfig, nil),
+            // Filesystem tools
+            builtin.NewFilesystemTool(rtConfig.Config.WorkingDir, nil),
+            // Think tool for reasoning
+            builtin.NewThinkTool(),
+            // Todo tool for task tracking
+            builtin.NewTodoTool(false), // false = not shared
+        ),
+    )
+}</code></pre>
+
+<h2>Using Different Providers</h2>
+
+<pre><code class="language-go">import (
+    "github.com/docker/cagent/pkg/model/provider/anthropic"
+    "github.com/docker/cagent/pkg/model/provider/gemini"
+    "github.com/docker/cagent/pkg/model/provider/openai"
+)
+
+// OpenAI
+openaiClient, _ := openai.NewClient(ctx, &latest.ModelConfig{
+    Provider: "openai",
+    Model:    "gpt-4o",
+}, env)
+
+// Anthropic
+anthropicClient, _ := anthropic.NewClient(ctx, &latest.ModelConfig{
+    Provider: "anthropic",
+    Model:    "claude-sonnet-4-0",
+    MaxTokens: 64000,
+}, env)
+
+// Google Gemini
+geminiClient, _ := gemini.NewClient(ctx, &latest.ModelConfig{
+    Provider: "google",
+    Model:    "gemini-2.5-flash",
+}, env)</code></pre>
+
+<h2>Session Options</h2>
+
+<pre><code class="language-go">import "github.com/docker/cagent/pkg/session"
+
+sess := session.New(
+    // Set a title for the session
+    session.WithTitle("Code Review Task"),
+    
+    // Add user message
+    session.WithUserMessage("Review this code for bugs"),
+    
+    // Limit iterations
+    session.WithMaxIterations(20),
+    
+    // Include a file attachment
+    session.WithUserMessage("main.go", codeContent),
+)</code></pre>
+
+<h2>Error Handling</h2>
+
+<pre><code class="language-go">messages, err := rt.Run(ctx, sess)
+if err != nil {
+    if errors.Is(err, context.Canceled) {
+        // User cancelled
+        log.Println("Operation cancelled")
+        return nil
+    }
+    if errors.Is(err, context.DeadlineExceeded) {
+        // Timeout
+        log.Println("Operation timed out")
+        return nil
+    }
+    // Other error
+    return fmt.Errorf("runtime error: %w", err)
+}
+
+// Check for errors in the event stream
+for event := range rt.RunStream(ctx, sess) {
+    if errEvent, ok := event.(*runtime.ErrorEvent); ok {
+        return fmt.Errorf("stream error: %s", errEvent.Error)
+    }
+}</code></pre>
+
+<h2>Complete Example</h2>
+
+<p>See the <a href="https://github.com/docker/cagent/tree/main/examples/golibrary" target="_blank" rel="noopener noreferrer">examples/golibrary</a> directory for complete working examples:</p>
+
+<ul>
+  <li><code>simple/</code> — Basic agent with no tools</li>
+  <li><code>tool/</code> — Custom tool implementation</li>
+  <li><code>stream/</code> — Streaming event handling</li>
+  <li><code>multi/</code> — Multi-agent with sub-agents</li>
+  <li><code>builtintool/</code> — Using built-in tools</li>
+</ul>
diff --git a/docs/pages/guides/tips.html b/docs/pages/guides/tips.html
new file mode 100644
index 000000000..7c0823575
--- /dev/null
+++ b/docs/pages/guides/tips.html
@@ -0,0 +1,349 @@
+<h1>Tips & Best Practices</h1>
+<p class="subtitle">Expert guidance for building effective, efficient, and secure agents.</p>
+
+<h2>Configuration Tips</h2>
+
+<h3>Auto Mode for Quick Start</h3>
+
+<p>Don't have a config file? cagent can automatically detect your available API keys and use an appropriate model:</p>
+
+<pre><code class="language-bash"># Automatically uses the best available provider
+cagent run
+
+# Provider priority: OpenAI → Anthropic → Google → Mistral → DMR</code></pre>
+
+<p>The special <code>auto</code> model value also works in configs:</p>
+
+<pre><code class="language-yaml">agents:
+  root:
+    model: auto  # Uses best available provider
+    description: Adaptive assistant
+    instruction: You are a helpful assistant.</code></pre>
+
+<h3>Environment Variable Interpolation</h3>
+
+<p>Commands support JavaScript template literal syntax for environment variables:</p>
+
+<pre><code class="language-yaml">agents:
+  root:
+    model: openai/gpt-4o
+    description: Deployment assistant
+    instruction: You help with deployments.
+    commands:
+      # Simple variable
+      greet: "Hello ${env.USER}!"
+      
+      # With default value
+      deploy: "Deploy to ${env.ENV || 'staging'}"
+      
+      # Multiple variables
+      release: "Release ${env.PROJECT} v${env.VERSION || '1.0.0'}"</code></pre>
+
+<h3>Model Aliases Are Auto-Pinned</h3>
+
+<p>cagent automatically resolves model aliases to their latest pinned versions. This ensures reproducible behavior:</p>
+
+<pre><code class="language-yaml"># You write:
+model: anthropic/claude-sonnet-4-5
+
+# cagent resolves to:
+# anthropic/claude-sonnet-4-5-20250929 (or latest available)</code></pre>
+
+<p>To use a specific version, specify it explicitly in your config.</p>
+
+<h2>Performance Tips</h2>
+
+<h3>Defer Tools for Faster Startup</h3>
+
+<p>Large MCP toolsets can slow down agent startup. Use <code>defer</code> to load tools on-demand:</p>
+
+<pre><code class="language-yaml">agents:
+  root:
+    model: openai/gpt-4o
+    description: Multi-tool assistant
+    instruction: You have many tools available.
+    toolsets:
+      - type: mcp
+        ref: docker:github-official
+      - type: mcp
+        ref: docker:slack
+      - type: mcp
+        ref: docker:linear
+    # Load all tools on first use
+    defer: true</code></pre>
+
+<p>Or defer specific tools:</p>
+
+<pre><code class="language-yaml">defer:
+  - "mcp:github:*"    # Defer GitHub tools
+  - "mcp:slack:*"     # Defer Slack tools</code></pre>
+
+<h3>Filter MCP Tools</h3>
+
+<p>Many MCP servers expose dozens of tools. Filter to only what you need:</p>
+
+<pre><code class="language-yaml">toolsets:
+  - type: mcp
+    ref: docker:github-official
+    # Only expose these specific tools
+    tools:
+      - list_issues
+      - create_issue
+      - get_pull_request
+      - create_pull_request</code></pre>
+
+<p>Fewer tools means faster tool selection and less confusion for the model.</p>
+
+<h3>Set max_iterations</h3>
+
+<p>Always set <code>max_iterations</code> for agents with powerful tools to prevent infinite loops:</p>
+
+<pre><code class="language-yaml">agents:
+  developer:
+    model: anthropic/claude-sonnet-4-0
+    description: Development assistant
+    instruction: You are a developer.
+    max_iterations: 30  # Reasonable limit for development tasks
+    toolsets:
+      - type: filesystem
+      - type: shell</code></pre>
+
+<p>Typical values: 20-30 for development agents, 10-15 for simple tasks.</p>
+
+<h2>Reliability Tips</h2>
+
+<h3>Use Fallback Models</h3>
+
+<p>Configure fallback models for resilience against provider outages or rate limits:</p>
+
+<pre><code class="language-yaml">agents:
+  root:
+    model: anthropic/claude-sonnet-4-0
+    description: Reliable assistant
+    instruction: You are a helpful assistant.
+    fallback:
+      models:
+        # Different provider for resilience
+        - openai/gpt-4o
+        # Cheaper model as last resort
+        - openai/gpt-4o-mini
+      retries: 2      # Retry 5xx errors twice
+      cooldown: 1m    # Stick with fallback for 1 min after rate limit</code></pre>
+
+<p><strong>Best practices for fallback chains:</strong></p>
+<ul>
+  <li>Use different providers for true redundancy</li>
+  <li>Order by preference (best first)</li>
+  <li>Include a cheaper/faster model as last resort</li>
+</ul>
+
+<h3>Use Think Tool for Complex Tasks</h3>
+
+<p>The <code>think</code> tool dramatically improves reasoning quality with minimal overhead:</p>
+
+<pre><code class="language-yaml">toolsets:
+  - type: think  # Always include for complex agents</code></pre>
+
+<p>The agent uses it as a scratchpad for planning and decision-making.</p>
+
+<h2>Security Tips</h2>
+
+<h3>Use --yolo Mode Carefully</h3>
+
+<p>The <code>--yolo</code> flag auto-approves all tool calls without confirmation:</p>
+
+<pre><code class="language-bash"># Auto-approve everything (use with caution!)
+cagent run agent.yaml --yolo</code></pre>
+
+<p><strong>When it's appropriate:</strong></p>
+<ul>
+  <li>CI/CD pipelines with controlled inputs</li>
+  <li>Automated testing</li>
+  <li>Agents with only safe, read-only tools</li>
+</ul>
+
+<p><strong>When to avoid:</strong></p>
+<ul>
+  <li>Interactive sessions with untested prompts</li>
+  <li>Agents with shell or filesystem write access</li>
+  <li>Any situation where unreviewed actions could cause harm</li>
+</ul>
+
+<h3>Combine Permissions with Sandbox</h3>
+
+<p>For defense in depth, use both permissions and sandbox mode:</p>
+
+<pre><code class="language-yaml">agents:
+  secure_dev:
+    model: anthropic/claude-sonnet-4-0
+    description: Secure development assistant
+    instruction: You are a secure coding assistant.
+    toolsets:
+      - type: filesystem
+      - type: shell
+    # Layer 1: Permission controls
+    permissions:
+      allow:
+        - "read_*"
+        - "shell:cmd=go*"
+        - "shell:cmd=npm*"
+      deny:
+        - "shell:cmd=sudo*"
+        - "shell:cmd=rm*-rf*"
+    # Layer 2: Container isolation
+    sandbox:
+      image: golang:1.23-alpine
+      paths:
+        - ".:rw"</code></pre>
+
+<h3>Use Hooks for Audit Logging</h3>
+
+<p>Log all tool calls for compliance or debugging:</p>
+
+<pre><code class="language-yaml">agents:
+  audited:
+    model: openai/gpt-4o
+    description: Audited assistant
+    instruction: You are a helpful assistant.
+    hooks:
+      post_tool_use:
+        - matcher: "*"
+          hooks:
+            - type: command
+              command: "./scripts/audit-log.sh"</code></pre>
+
+<h2>Multi-Agent Tips</h2>
+
+<h3>Handoffs vs Sub-Agents</h3>
+
+<p>Understand the difference between <code>sub_agents</code> and <code>handoffs</code>:</p>
+
+<div class="cards">
+  <div class="card" style="cursor:default;">
+    <h3>sub_agents (transfer_task)</h3>
+    <p>Delegates task to a child, waits for result, then continues. The parent remains in control.</p>
+    <pre style="margin-top:12px"><code class="language-yaml">sub_agents: [researcher, writer]</code></pre>
+  </div>
+  <div class="card" style="cursor:default;">
+    <h3>handoffs (A2A)</h3>
+    <p>Transfers control entirely to another agent (possibly remote). One-way handoff.</p>
+    <pre style="margin-top:12px"><code class="language-yaml">handoffs:
+  - name: specialist
+    url: http://...</code></pre>
+  </div>
+</div>
+
+<h3>Give Sub-Agents Clear Descriptions</h3>
+
+<p>The root agent uses descriptions to decide which sub-agent to delegate to:</p>
+
+<pre><code class="language-yaml">agents:
+  root:
+    model: anthropic/claude-sonnet-4-0
+    description: Technical lead
+    instruction: Delegate to specialists based on the task.
+    sub_agents: [frontend, backend, devops]
+
+  frontend:
+    model: openai/gpt-4o
+    # Good: specific and actionable
+    description: |
+      Frontend specialist. Handles React, TypeScript, CSS, 
+      UI components, and browser-related issues.
+
+  backend:
+    model: openai/gpt-4o
+    # Good: clear domain boundaries
+    description: |
+      Backend specialist. Handles APIs, databases, 
+      server logic, and Go/Python code.
+
+  devops:
+    model: openai/gpt-4o
+    description: |
+      DevOps specialist. Handles CI/CD, Docker, Kubernetes,
+      infrastructure, and deployment pipelines.</code></pre>
+
+<h2>Debugging Tips</h2>
+
+<h3>Enable Debug Logging</h3>
+
+<p>Use the <code>--debug</code> flag to see detailed execution logs:</p>
+
+<pre><code class="language-bash"># Default log location: ~/.cagent/cagent.debug.log
+cagent run agent.yaml --debug
+
+# Custom log location
+cagent run agent.yaml --debug --log-file ./debug.log</code></pre>
+
+<h3>Check Token Usage</h3>
+
+<p>Use the <code>/usage</code> command during a session to see token consumption:</p>
+
+<pre><code class="language-text">/usage
+
+Token Usage:
+  Input:  12,456 tokens
+  Output:  3,789 tokens
+  Total:  16,245 tokens</code></pre>
+
+<h3>Compact Long Sessions</h3>
+
+<p>If a session gets too long, use <code>/compact</code> to summarize and reduce context:</p>
+
+<pre><code class="language-text">/compact
+
+Session compacted. Summary generated and history trimmed.</code></pre>
+
+<h2>More Tips</h2>
+
+<h3>User-Defined Default Model</h3>
+
+<p>Set your preferred default model in <code>~/.config/cagent/config.yaml</code>:</p>
+
+<pre><code class="language-yaml">settings:
+  default_model: anthropic/claude-sonnet-4-0</code></pre>
+
+<p>This model is used when you run <code>cagent run</code> without a config file.</p>
+
+<h3>GitHub PR Reviewer Example</h3>
+
+<p>Use cagent as a GitHub Actions PR reviewer:</p>
+
+<pre><code class="language-yaml"># .github/workflows/pr-review.yml
+name: PR Review
+on:
+  pull_request:
+    types: [opened, synchronize]
+
+jobs:
+  review:
+    runs-on: ubuntu-latest
+    steps:
+      - uses: actions/checkout@v4
+      - name: Run cagent review
+        env:
+          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
+          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
+        run: |
+          # Install cagent
+          curl -fsSL https://get.cagent.dev | sh
+          
+          # Run the review
+          cagent exec reviewer.yaml --yolo \
+            "Review PR #${{ github.event.pull_request.number }}"</code></pre>
+
+<p>With a simple reviewer agent:</p>
+
+<pre><code class="language-yaml"># reviewer.yaml
+agents:
+  root:
+    model: anthropic/claude-sonnet-4-0
+    description: PR reviewer
+    instruction: |
+      Review pull requests for code quality, bugs, and security issues.
+      Be constructive and specific in your feedback.
+    toolsets:
+      - type: mcp
+        ref: docker:github-official
+      - type: think</code></pre>
diff --git a/docs/pages/providers/local.html b/docs/pages/providers/local.html
new file mode 100644
index 000000000..8bf6b39b7
--- /dev/null
+++ b/docs/pages/providers/local.html
@@ -0,0 +1,194 @@
+<h1>Local Models (Ollama, vLLM, LocalAI)</h1>
+<p class="subtitle">Run cagent with locally hosted models for privacy, offline use, or cost savings.</p>
+
+<h2>Overview</h2>
+
+<p>cagent can connect to any OpenAI-compatible local model server. This guide covers the most popular options:</p>
+
+<ul>
+  <li><strong>Ollama</strong> — Easy-to-use local model runner</li>
+  <li><strong>vLLM</strong> — High-performance inference server</li>
+  <li><strong>LocalAI</strong> — OpenAI-compatible API for various backends</li>
+</ul>
+
+<div class="callout callout-tip">
+  <div class="callout-title">💡 Docker Model Runner</div>
+  <p>For the easiest local model experience, consider <a href="#providers/dmr" onclick="event.preventDefault(); navigate('providers/dmr')">Docker Model Runner</a> which is built into Docker Desktop and requires no additional setup.</p>
+</div>
+
+<h2>Ollama</h2>
+
+<p>Ollama is a popular tool for running LLMs locally. cagent includes a built-in <code>ollama</code> alias for easy configuration.</p>
+
+<h3>Setup</h3>
+
+<ol>
+  <li>Install Ollama from <a href="https://ollama.ai/" target="_blank" rel="noopener noreferrer">ollama.ai</a></li>
+  <li>Pull a model:
+    <pre><code class="language-bash">ollama pull llama3.2
+ollama pull qwen2.5-coder</code></pre>
+  </li>
+  <li>Start the Ollama server (usually runs automatically):
+    <pre><code class="language-bash">ollama serve</code></pre>
+  </li>
+</ol>
+
+<h3>Configuration</h3>
+
+<p>Use the built-in <code>ollama</code> alias:</p>
+
+<pre><code class="language-yaml">agents:
+  root:
+    model: ollama/llama3.2
+    description: Local assistant
+    instruction: You are a helpful assistant.</code></pre>
+
+<p>The <code>ollama</code> alias automatically uses:</p>
+<ul>
+  <li><strong>Base URL:</strong> <code>http://localhost:11434/v1</code></li>
+  <li><strong>API Type:</strong> OpenAI-compatible</li>
+  <li><strong>No API key required</strong></li>
+</ul>
+
+<h3>Custom Port or Host</h3>
+
+<p>If Ollama runs on a different host or port:</p>
+
+<pre><code class="language-yaml">models:
+  my_ollama:
+    provider: ollama
+    model: llama3.2
+    base_url: http://192.168.1.100:11434/v1
+
+agents:
+  root:
+    model: my_ollama
+    description: Remote Ollama assistant
+    instruction: You are a helpful assistant.</code></pre>
+
+<h3>Popular Ollama Models</h3>
+
+<table>
+  <thead><tr><th>Model</th><th>Size</th><th>Best For</th></tr></thead>
+  <tbody>
+    <tr><td><code>llama3.2</code></td><td>3B</td><td>General purpose, fast</td></tr>
+    <tr><td><code>llama3.1</code></td><td>8B</td><td>Better reasoning</td></tr>
+    <tr><td><code>qwen2.5-coder</code></td><td>7B</td><td>Code generation</td></tr>
+    <tr><td><code>mistral</code></td><td>7B</td><td>General purpose</td></tr>
+    <tr><td><code>codellama</code></td><td>7B</td><td>Code tasks</td></tr>
+    <tr><td><code>deepseek-coder</code></td><td>6.7B</td><td>Code generation</td></tr>
+  </tbody>
+</table>
+
+<h2>vLLM</h2>
+
+<p>vLLM is a high-performance inference server optimized for throughput.</p>
+
+<h3>Setup</h3>
+
+<pre><code class="language-bash"># Install vLLM
+pip install vllm
+
+# Start the server
+python -m vllm.entrypoints.openai.api_server \
+  --model meta-llama/Llama-3.2-3B-Instruct \
+  --port 8000</code></pre>
+
+<h3>Configuration</h3>
+
+<pre><code class="language-yaml">providers:
+  vllm:
+    api_type: openai_chatcompletions
+    base_url: http://localhost:8000/v1
+
+agents:
+  root:
+    model: vllm/meta-llama/Llama-3.2-3B-Instruct
+    description: vLLM-powered assistant
+    instruction: You are a helpful assistant.</code></pre>
+
+<h2>LocalAI</h2>
+
+<p>LocalAI provides an OpenAI-compatible API that works with various backends.</p>
+
+<h3>Setup</h3>
+
+<pre><code class="language-bash"># Run with Docker
+docker run -p 8080:8080 --name local-ai \
+  -v ./models:/models \
+  localai/localai:latest-cpu</code></pre>
+
+<h3>Configuration</h3>
+
+<pre><code class="language-yaml">providers:
+  localai:
+    api_type: openai_chatcompletions
+    base_url: http://localhost:8080/v1
+
+agents:
+  root:
+    model: localai/gpt4all-j
+    description: LocalAI assistant
+    instruction: You are a helpful assistant.</code></pre>
+
+<h2>Generic Custom Provider</h2>
+
+<p>For any OpenAI-compatible server:</p>
+
+<pre><code class="language-yaml">providers:
+  my_server:
+    api_type: openai_chatcompletions
+    base_url: http://localhost:8000/v1
+    # token_key: MY_API_KEY  # if auth required
+
+agents:
+  root:
+    model: my_server/model-name
+    description: Custom server assistant
+    instruction: You are a helpful assistant.</code></pre>
+
+<h2>Performance Tips</h2>
+
+<div class="callout callout-info">
+  <div class="callout-title">ℹ️ Local Model Considerations</div>
+  <ul>
+    <li><strong>Memory:</strong> Larger models need more RAM/VRAM. A 7B model typically needs 8-16GB RAM.</li>
+    <li><strong>GPU:</strong> GPU acceleration dramatically improves speed. Check your server's GPU support.</li>
+    <li><strong>Context length:</strong> Local models often have smaller context windows than cloud models.</li>
+    <li><strong>Tool calling:</strong> Not all local models support function/tool calling. Test your model's capabilities.</li>
+  </ul>
+</div>
+
+<h2>Example: Offline Development Agent</h2>
+
+<pre><code class="language-yaml">agents:
+  developer:
+    model: ollama/qwen2.5-coder
+    description: Offline code assistant
+    instruction: |
+      You are a software developer working offline.
+      Focus on code quality and clear explanations.
+    max_iterations: 20
+    toolsets:
+      - type: filesystem
+      - type: shell
+      - type: think
+      - type: todo</code></pre>
+
+<h2>Troubleshooting</h2>
+
+<h3>Connection Refused</h3>
+<p>Ensure your model server is running and accessible:</p>
+<pre><code class="language-bash">curl http://localhost:11434/v1/models  # Ollama
+curl http://localhost:8000/v1/models   # vLLM</code></pre>
+
+<h3>Model Not Found</h3>
+<p>Verify the model is downloaded/available:</p>
+<pre><code class="language-bash">ollama list  # List available Ollama models</code></pre>
+
+<h3>Slow Responses</h3>
+<ul>
+  <li>Check if GPU acceleration is enabled</li>
+  <li>Try a smaller model</li>
+  <li>Reduce <code>max_tokens</code> in your config</li>
+</ul>
diff --git a/docs/pages/providers/mistral.html b/docs/pages/providers/mistral.html
new file mode 100644
index 000000000..d4855e936
--- /dev/null
+++ b/docs/pages/providers/mistral.html
@@ -0,0 +1,110 @@
+<h1>Mistral</h1>
+<p class="subtitle">Use Mistral AI models with cagent.</p>
+
+<h2>Overview</h2>
+
+<p>Mistral AI provides powerful language models through an OpenAI-compatible API. cagent includes built-in support for Mistral as an alias provider.</p>
+
+<h2>Setup</h2>
+
+<ol>
+  <li>Get an API key from <a href="https://console.mistral.ai/" target="_blank" rel="noopener noreferrer">Mistral Console</a></li>
+  <li>Set the environment variable:
+    <pre><code class="language-bash">export MISTRAL_API_KEY=your-api-key</code></pre>
+  </li>
+</ol>
+
+<h2>Usage</h2>
+
+<h3>Inline Syntax</h3>
+
+<p>The simplest way to use Mistral:</p>
+
+<pre><code class="language-yaml">agents:
+  root:
+    model: mistral/mistral-large-latest
+    description: Assistant using Mistral
+    instruction: You are a helpful assistant.</code></pre>
+
+<h3>Named Model</h3>
+
+<p>For more control over parameters:</p>
+
+<pre><code class="language-yaml">models:
+  mistral:
+    provider: mistral
+    model: mistral-large-latest
+    temperature: 0.7
+    max_tokens: 8192
+
+agents:
+  root:
+    model: mistral
+    description: Assistant using Mistral
+    instruction: You are a helpful assistant.</code></pre>
+
+<h2>Available Models</h2>
+
+<table>
+  <thead><tr><th>Model</th><th>Description</th><th>Context</th></tr></thead>
+  <tbody>
+    <tr><td><code>mistral-large-latest</code></td><td>Most capable Mistral model</td><td>128K</td></tr>
+    <tr><td><code>mistral-medium-latest</code></td><td>Balanced performance and cost</td><td>128K</td></tr>
+    <tr><td><code>mistral-small-latest</code></td><td>Fast and cost-effective (default)</td><td>128K</td></tr>
+    <tr><td><code>codestral-latest</code></td><td>Optimized for code generation</td><td>32K</td></tr>
+    <tr><td><code>open-mistral-nemo</code></td><td>Open-weight model</td><td>128K</td></tr>
+    <tr><td><code>ministral-8b-latest</code></td><td>Compact 8B parameter model</td><td>128K</td></tr>
+    <tr><td><code>ministral-3b-latest</code></td><td>Smallest Mistral model</td><td>128K</td></tr>
+  </tbody>
+</table>
+
+<p>Check the <a href="https://docs.mistral.ai/getting-started/models/" target="_blank" rel="noopener noreferrer">Mistral Models documentation</a> for the latest available models.</p>
+
+<h2>Auto-Detection</h2>
+
+<p>When you run <code>cagent run</code> without specifying a config, cagent automatically detects available providers. If <code>MISTRAL_API_KEY</code> is set and higher-priority providers (OpenAI, Anthropic, Google) are not available, Mistral will be used with <code>mistral-small-latest</code> as the default model.</p>
+
+<h2>Extended Thinking</h2>
+
+<p>Mistral models support thinking mode through the OpenAI-compatible API. By default, cagent enables <code>medium</code> thinking effort:</p>
+
+<pre><code class="language-yaml">models:
+  mistral:
+    provider: mistral
+    model: mistral-large-latest
+    thinking_budget: high  # minimal, low, medium, high, or none</code></pre>
+
+<p>To disable thinking:</p>
+
+<pre><code class="language-yaml">models:
+  mistral:
+    provider: mistral
+    model: mistral-large-latest
+    thinking_budget: none</code></pre>
+
+<h2>How It Works</h2>
+
+<p>Mistral is implemented as a built-in alias in cagent:</p>
+
+<ul>
+  <li><strong>API Type:</strong> OpenAI-compatible (<code>openai_chatcompletions</code>)</li>
+  <li><strong>Base URL:</strong> <code>https://api.mistral.ai/v1</code></li>
+  <li><strong>Token Variable:</strong> <code>MISTRAL_API_KEY</code></li>
+</ul>
+
+<p>This means Mistral uses the same client as OpenAI, making it fully compatible with all OpenAI features supported by cagent.</p>
+
+<h2>Example: Code Assistant</h2>
+
+<pre><code class="language-yaml">agents:
+  coder:
+    model: mistral/codestral-latest
+    description: Expert code assistant
+    instruction: |
+      You are an expert programmer using Codestral.
+      Write clean, efficient, well-documented code.
+      Explain your reasoning when helpful.
+    toolsets:
+      - type: filesystem
+      - type: shell
+      - type: think</code></pre>
diff --git a/docs/pages/providers/nebius.html b/docs/pages/providers/nebius.html
new file mode 100644
index 000000000..c5058ff73
--- /dev/null
+++ b/docs/pages/providers/nebius.html
@@ -0,0 +1,82 @@
+<h1>Nebius</h1>
+<p class="subtitle">Use Nebius AI models with cagent.</p>
+
+<h2>Overview</h2>
+
+<p>Nebius provides AI models through an OpenAI-compatible API. cagent includes built-in support for Nebius as an alias provider.</p>
+
+<h2>Setup</h2>
+
+<ol>
+  <li>Get an API key from <a href="https://nebius.ai/" target="_blank" rel="noopener noreferrer">Nebius AI</a></li>
+  <li>Set the environment variable:
+    <pre><code class="language-bash">export NEBIUS_API_KEY=your-api-key</code></pre>
+  </li>
+</ol>
+
+<h2>Usage</h2>
+
+<h3>Inline Syntax</h3>
+
+<p>The simplest way to use Nebius:</p>
+
+<pre><code class="language-yaml">agents:
+  root:
+    model: nebius/deepseek-ai/DeepSeek-V3
+    description: Assistant using Nebius
+    instruction: You are a helpful assistant.</code></pre>
+
+<h3>Named Model</h3>
+
+<p>For more control over parameters:</p>
+
+<pre><code class="language-yaml">models:
+  nebius_model:
+    provider: nebius
+    model: deepseek-ai/DeepSeek-V3
+    temperature: 0.7
+    max_tokens: 8192
+
+agents:
+  root:
+    model: nebius_model
+    description: Assistant using Nebius
+    instruction: You are a helpful assistant.</code></pre>
+
+<h2>Available Models</h2>
+
+<p>Nebius hosts various open models. Check the <a href="https://nebius.ai/docs" target="_blank" rel="noopener noreferrer">Nebius documentation</a> for the current model catalog.</p>
+
+<table>
+  <thead><tr><th>Model</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>deepseek-ai/DeepSeek-V3</code></td><td>DeepSeek V3 model</td></tr>
+    <tr><td><code>Qwen/Qwen2.5-72B-Instruct</code></td><td>Qwen 2.5 72B instruction-tuned</td></tr>
+    <tr><td><code>meta-llama/Llama-3.3-70B-Instruct</code></td><td>Llama 3.3 70B instruction-tuned</td></tr>
+  </tbody>
+</table>
+
+<h2>How It Works</h2>
+
+<p>Nebius is implemented as a built-in alias in cagent:</p>
+
+<ul>
+  <li><strong>API Type:</strong> OpenAI-compatible (<code>openai_chatcompletions</code>)</li>
+  <li><strong>Base URL:</strong> <code>https://api.studio.nebius.ai/v1</code></li>
+  <li><strong>Token Variable:</strong> <code>NEBIUS_API_KEY</code></li>
+</ul>
+
+<h2>Example: Code Assistant</h2>
+
+<pre><code class="language-yaml">agents:
+  coder:
+    model: nebius/deepseek-ai/DeepSeek-V3
+    description: Code assistant using DeepSeek
+    instruction: |
+      You are an expert programmer using DeepSeek V3.
+      Write clean, well-documented code.
+      Follow best practices for the language being used.
+    toolsets:
+      - type: filesystem
+      - type: shell
+      - type: think</code></pre>
diff --git a/docs/pages/providers/xai.html b/docs/pages/providers/xai.html
new file mode 100644
index 000000000..1260936fb
--- /dev/null
+++ b/docs/pages/providers/xai.html
@@ -0,0 +1,95 @@
+<h1>xAI (Grok)</h1>
+<p class="subtitle">Use xAI's Grok models with cagent.</p>
+
+<h2>Overview</h2>
+
+<p>xAI provides the Grok family of models through an OpenAI-compatible API. cagent includes built-in support for xAI as an alias provider.</p>
+
+<h2>Setup</h2>
+
+<ol>
+  <li>Get an API key from <a href="https://console.x.ai/" target="_blank" rel="noopener noreferrer">xAI Console</a></li>
+  <li>Set the environment variable:
+    <pre><code class="language-bash">export XAI_API_KEY=your-api-key</code></pre>
+  </li>
+</ol>
+
+<h2>Usage</h2>
+
+<h3>Inline Syntax</h3>
+
+<p>The simplest way to use xAI:</p>
+
+<pre><code class="language-yaml">agents:
+  root:
+    model: xai/grok-3
+    description: Assistant using Grok
+    instruction: You are a helpful assistant.</code></pre>
+
+<h3>Named Model</h3>
+
+<p>For more control over parameters:</p>
+
+<pre><code class="language-yaml">models:
+  grok:
+    provider: xai
+    model: grok-3
+    temperature: 0.7
+    max_tokens: 8192
+
+agents:
+  root:
+    model: grok
+    description: Assistant using Grok
+    instruction: You are a helpful assistant.</code></pre>
+
+<h2>Available Models</h2>
+
+<table>
+  <thead><tr><th>Model</th><th>Description</th><th>Context</th></tr></thead>
+  <tbody>
+    <tr><td><code>grok-3</code></td><td>Latest and most capable Grok model</td><td>131K</td></tr>
+    <tr><td><code>grok-3-fast</code></td><td>Faster variant with lower latency</td><td>131K</td></tr>
+    <tr><td><code>grok-3-mini</code></td><td>Compact model for simpler tasks</td><td>131K</td></tr>
+    <tr><td><code>grok-3-mini-fast</code></td><td>Fast variant of the mini model</td><td>131K</td></tr>
+    <tr><td><code>grok-2</code></td><td>Previous generation model</td><td>128K</td></tr>
+    <tr><td><code>grok-vision</code></td><td>Vision-capable model</td><td>32K</td></tr>
+  </tbody>
+</table>
+
+<p>Check the <a href="https://docs.x.ai/docs" target="_blank" rel="noopener noreferrer">xAI documentation</a> for the latest available models.</p>
+
+<h2>Extended Thinking</h2>
+
+<p>Grok models support thinking mode through the OpenAI-compatible API:</p>
+
+<pre><code class="language-yaml">models:
+  grok:
+    provider: xai
+    model: grok-3
+    thinking_budget: high  # minimal, low, medium, high, or none</code></pre>
+
+<h2>How It Works</h2>
+
+<p>xAI is implemented as a built-in alias in cagent:</p>
+
+<ul>
+  <li><strong>API Type:</strong> OpenAI-compatible (<code>openai_chatcompletions</code>)</li>
+  <li><strong>Base URL:</strong> <code>https://api.x.ai/v1</code></li>
+  <li><strong>Token Variable:</strong> <code>XAI_API_KEY</code></li>
+</ul>
+
+<h2>Example: Research Assistant</h2>
+
+<pre><code class="language-yaml">agents:
+  researcher:
+    model: xai/grok-3
+    description: Research assistant with real-time knowledge
+    instruction: |
+      You are a research assistant using Grok.
+      Provide well-researched, factual responses.
+      Cite sources when available.
+    toolsets:
+      - type: mcp
+        ref: docker:duckduckgo
+      - type: think</code></pre>
diff --git a/docs/pages/tools/api.html b/docs/pages/tools/api.html
new file mode 100644
index 000000000..56dd4bd7a
--- /dev/null
+++ b/docs/pages/tools/api.html
@@ -0,0 +1,201 @@
+<h1>API Tool</h1>
+<p class="subtitle">Create custom tools that call HTTP APIs.</p>
+
+<h2>Overview</h2>
+
+<p>The API tool type lets you define custom tools that make HTTP requests to external APIs. This is useful for integrating agents with REST APIs, webhooks, or any HTTP-based service without writing code.</p>
+
+<div class="callout callout-info">
+  <div class="callout-title">ℹ️ When to Use</div>
+  <ul>
+    <li>Integrating with REST APIs that don't have an MCP server</li>
+    <li>Simple HTTP operations (GET, POST)</li>
+    <li>Quick prototyping before building a full MCP server</li>
+  </ul>
+</div>
+
+<h2>Configuration</h2>
+
+<pre><code class="language-yaml">agents:
+  assistant:
+    model: openai/gpt-4o
+    description: Assistant with API access
+    instruction: You can look up weather information.
+    toolsets:
+      - type: api
+        name: get_weather
+        method: GET
+        endpoint: "https://api.weather.example/v1/current?city=${city}"
+        instruction: Get current weather for a city
+        args:
+          city:
+            type: string
+            description: City name to get weather for
+        required: ["city"]
+        headers:
+          Authorization: "Bearer ${env.WEATHER_API_KEY}"</code></pre>
+
+<h2>Properties</h2>
+
+<table>
+  <thead><tr><th>Property</th><th>Type</th><th>Required</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>name</code></td><td>string</td><td>✓</td><td>Tool name (how the agent references it)</td></tr>
+    <tr><td><code>method</code></td><td>string</td><td>✓</td><td>HTTP method: <code>GET</code> or <code>POST</code></td></tr>
+    <tr><td><code>endpoint</code></td><td>string</td><td>✓</td><td>URL endpoint (supports <code>${param}</code> interpolation)</td></tr>
+    <tr><td><code>instruction</code></td><td>string</td><td>✗</td><td>Description shown to the agent</td></tr>
+    <tr><td><code>args</code></td><td>object</td><td>✗</td><td>Parameter definitions (JSON Schema properties)</td></tr>
+    <tr><td><code>required</code></td><td>array</td><td>✗</td><td>List of required parameter names</td></tr>
+    <tr><td><code>headers</code></td><td>object</td><td>✗</td><td>HTTP headers to include</td></tr>
+    <tr><td><code>output_schema</code></td><td>object</td><td>✗</td><td>JSON Schema for the response (for documentation)</td></tr>
+  </tbody>
+</table>
+
+<h2>HTTP Methods</h2>
+
+<h3>GET Requests</h3>
+
+<p>For GET requests, parameters are interpolated into the URL:</p>
+
+<pre><code class="language-yaml">toolsets:
+  - type: api
+    name: search_users
+    method: GET
+    endpoint: "https://api.example.com/users?q=${query}&limit=${limit}"
+    instruction: Search for users by name
+    args:
+      query:
+        type: string
+        description: Search query
+      limit:
+        type: integer
+        description: Maximum results (default 10)
+    required: ["query"]</code></pre>
+
+<h3>POST Requests</h3>
+
+<p>For POST requests, parameters are sent as JSON in the request body:</p>
+
+<pre><code class="language-yaml">toolsets:
+  - type: api
+    name: create_task
+    method: POST
+    endpoint: "https://api.example.com/tasks"
+    instruction: Create a new task
+    args:
+      title:
+        type: string
+        description: Task title
+      description:
+        type: string
+        description: Task description
+      priority:
+        type: string
+        enum: ["low", "medium", "high"]
+        description: Task priority
+    required: ["title"]
+    headers:
+      Content-Type: "application/json"
+      Authorization: "Bearer ${env.API_TOKEN}"</code></pre>
+
+<h2>URL Interpolation</h2>
+
+<p>Use <code>${param}</code> syntax to insert parameter values into URLs:</p>
+
+<pre><code class="language-yaml">endpoint: "https://api.example.com/users/${user_id}/posts/${post_id}"</code></pre>
+
+<p>Parameter values are URL-encoded automatically.</p>
+
+<h2>Headers</h2>
+
+<p>Headers can include environment variables:</p>
+
+<pre><code class="language-yaml">headers:
+  Authorization: "Bearer ${env.API_KEY}"
+  X-Custom-Header: "static-value"
+  Content-Type: "application/json"</code></pre>
+
+<h2>Output Schema</h2>
+
+<p>Optionally document the expected response format:</p>
+
+<pre><code class="language-yaml">toolsets:
+  - type: api
+    name: get_user
+    method: GET
+    endpoint: "https://api.example.com/users/${id}"
+    instruction: Get user details by ID
+    args:
+      id:
+        type: string
+        description: User ID
+    required: ["id"]
+    output_schema:
+      type: object
+      properties:
+        id:
+          type: string
+        name:
+          type: string
+        email:
+          type: string
+        created_at:
+          type: string</code></pre>
+
+<h2>Example: GitHub API</h2>
+
+<pre><code class="language-yaml">agents:
+  github_assistant:
+    model: openai/gpt-4o
+    description: Assistant that can query GitHub
+    instruction: You can look up GitHub repositories and users.
+    toolsets:
+      - type: api
+        name: get_repo
+        method: GET
+        endpoint: "https://api.github.com/repos/${owner}/${repo}"
+        instruction: Get information about a GitHub repository
+        args:
+          owner:
+            type: string
+            description: Repository owner (user or org)
+          repo:
+            type: string
+            description: Repository name
+        required: ["owner", "repo"]
+        headers:
+          Accept: "application/vnd.github.v3+json"
+          Authorization: "Bearer ${env.GITHUB_TOKEN}"
+      
+      - type: api
+        name: get_user
+        method: GET
+        endpoint: "https://api.github.com/users/${username}"
+        instruction: Get information about a GitHub user
+        args:
+          username:
+            type: string
+            description: GitHub username
+        required: ["username"]
+        headers:
+          Accept: "application/vnd.github.v3+json"</code></pre>
+
+<h2>Limitations</h2>
+
+<ul>
+  <li>Only supports GET and POST methods</li>
+  <li>Response body is limited to 1MB</li>
+  <li>30 second timeout per request</li>
+  <li>Only HTTP and HTTPS URLs are supported</li>
+  <li>No support for file uploads or multipart forms</li>
+</ul>
+
+<div class="callout callout-tip">
+  <div class="callout-title">💡 For Complex APIs</div>
+  <p>For APIs that need authentication flows, pagination, or complex request/response handling, consider using an MCP server instead. The API tool is best for simple, stateless HTTP operations.</p>
+</div>
+
+<div class="callout callout-warning">
+  <div class="callout-title">⚠️ Security</div>
+  <p>API keys and tokens in headers are visible in debug logs. Use environment variables (<code>${env.VAR}</code>) rather than hardcoding secrets in configuration files.</p>
+</div>
diff --git a/docs/pages/tools/lsp.html b/docs/pages/tools/lsp.html
new file mode 100644
index 000000000..91be6a7ec
--- /dev/null
+++ b/docs/pages/tools/lsp.html
@@ -0,0 +1,183 @@
+<h1>LSP Tool</h1>
+<p class="subtitle">Connect to Language Server Protocol servers for code intelligence.</p>
+
+<h2>Overview</h2>
+
+<p>The LSP tool connects your agent to any Language Server Protocol (LSP) server, providing comprehensive code intelligence capabilities like go-to-definition, find references, diagnostics, and more.</p>
+
+<div class="callout callout-info">
+  <div class="callout-title">ℹ️ What is LSP?</div>
+  <p>The <a href="https://microsoft.github.io/language-server-protocol/" target="_blank" rel="noopener noreferrer">Language Server Protocol</a> is a standard for providing language features like autocomplete, go-to-definition, and diagnostics. Most programming languages have LSP servers available.</p>
+</div>
+
+<h2>Configuration</h2>
+
+<pre><code class="language-yaml">agents:
+  developer:
+    model: anthropic/claude-sonnet-4-0
+    description: Code developer with LSP support
+    instruction: You are a software developer.
+    toolsets:
+      - type: lsp
+        command: gopls
+        args: []
+        file_types: [".go"]
+      - type: filesystem
+      - type: shell</code></pre>
+
+<h2>Properties</h2>
+
+<table>
+  <thead><tr><th>Property</th><th>Type</th><th>Required</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>command</code></td><td>string</td><td>✓</td><td>LSP server executable command</td></tr>
+    <tr><td><code>args</code></td><td>array</td><td>✗</td><td>Command-line arguments for the LSP server</td></tr>
+    <tr><td><code>env</code></td><td>object</td><td>✗</td><td>Environment variables for the LSP process</td></tr>
+    <tr><td><code>file_types</code></td><td>array</td><td>✗</td><td>File extensions this LSP handles (e.g., <code>[".go", ".mod"]</code>)</td></tr>
+  </tbody>
+</table>
+
+<h2>Available Tools</h2>
+
+<p>The LSP toolset provides these tools to the agent:</p>
+
+<table>
+  <thead><tr><th>Tool</th><th>Description</th><th>Read-Only</th></tr></thead>
+  <tbody>
+    <tr><td><code>lsp_workspace</code></td><td>Get workspace info and available capabilities</td><td>✓</td></tr>
+    <tr><td><code>lsp_hover</code></td><td>Get type info and documentation for a symbol</td><td>✓</td></tr>
+    <tr><td><code>lsp_definition</code></td><td>Find where a symbol is defined</td><td>✓</td></tr>
+    <tr><td><code>lsp_references</code></td><td>Find all references to a symbol</td><td>✓</td></tr>
+    <tr><td><code>lsp_document_symbols</code></td><td>List all symbols in a file</td><td>✓</td></tr>
+    <tr><td><code>lsp_workspace_symbols</code></td><td>Search symbols across the workspace</td><td>✓</td></tr>
+    <tr><td><code>lsp_diagnostics</code></td><td>Get errors and warnings for a file</td><td>✓</td></tr>
+    <tr><td><code>lsp_code_actions</code></td><td>Get available quick fixes and refactorings</td><td>✓</td></tr>
+    <tr><td><code>lsp_rename</code></td><td>Rename a symbol across the workspace</td><td>✗</td></tr>
+    <tr><td><code>lsp_format</code></td><td>Format a file</td><td>✗</td></tr>
+    <tr><td><code>lsp_call_hierarchy</code></td><td>Find incoming/outgoing calls</td><td>✓</td></tr>
+    <tr><td><code>lsp_type_hierarchy</code></td><td>Find supertypes/subtypes</td><td>✓</td></tr>
+    <tr><td><code>lsp_implementations</code></td><td>Find interface implementations</td><td>✓</td></tr>
+    <tr><td><code>lsp_signature_help</code></td><td>Get function signature at call site</td><td>✓</td></tr>
+    <tr><td><code>lsp_inlay_hints</code></td><td>Get type annotations and parameter names</td><td>✓</td></tr>
+  </tbody>
+</table>
+
+<h2>Common LSP Servers</h2>
+
+<p>Here are configurations for popular languages:</p>
+
+<h3>Go (gopls)</h3>
+
+<pre><code class="language-yaml">toolsets:
+  - type: lsp
+    command: gopls
+    file_types: [".go"]</code></pre>
+
+<h3>TypeScript/JavaScript (typescript-language-server)</h3>
+
+<pre><code class="language-yaml">toolsets:
+  - type: lsp
+    command: typescript-language-server
+    args: ["--stdio"]
+    file_types: [".ts", ".tsx", ".js", ".jsx"]</code></pre>
+
+<h3>Python (pylsp)</h3>
+
+<pre><code class="language-yaml">toolsets:
+  - type: lsp
+    command: pylsp
+    file_types: [".py"]</code></pre>
+
+<h3>Rust (rust-analyzer)</h3>
+
+<pre><code class="language-yaml">toolsets:
+  - type: lsp
+    command: rust-analyzer
+    file_types: [".rs"]</code></pre>
+
+<h3>C/C++ (clangd)</h3>
+
+<pre><code class="language-yaml">toolsets:
+  - type: lsp
+    command: clangd
+    file_types: [".c", ".cpp", ".h", ".hpp"]</code></pre>
+
+<h2>Multiple LSP Servers</h2>
+
+<p>You can configure multiple LSP servers for different file types:</p>
+
+<pre><code class="language-yaml">agents:
+  polyglot:
+    model: anthropic/claude-sonnet-4-0
+    description: Multi-language developer
+    instruction: You are a full-stack developer.
+    toolsets:
+      - type: lsp
+        command: gopls
+        file_types: [".go"]
+      - type: lsp
+        command: typescript-language-server
+        args: ["--stdio"]
+        file_types: [".ts", ".tsx", ".js", ".jsx"]
+      - type: lsp
+        command: pylsp
+        file_types: [".py"]
+      - type: filesystem
+      - type: shell</code></pre>
+
+<h2>Workflow Instructions</h2>
+
+<p>The LSP tool includes built-in instructions that guide the agent on how to use it effectively. The agent learns to:</p>
+
+<ol>
+  <li>Start with <code>lsp_workspace</code> to understand available capabilities</li>
+  <li>Use <code>lsp_workspace_symbols</code> to find relevant code</li>
+  <li>Use <code>lsp_references</code> before modifying any symbol</li>
+  <li>Check <code>lsp_diagnostics</code> after every code change</li>
+  <li>Apply <code>lsp_format</code> after edits are complete</li>
+</ol>
+
+<div class="callout callout-tip">
+  <div class="callout-title">💡 Best Practice</div>
+  <p>Always include the <code>filesystem</code> tool alongside LSP. The agent needs filesystem access to read and write code files, while LSP provides intelligence about the code.</p>
+</div>
+
+<h2>Capability Detection</h2>
+
+<p>Not all LSP servers support all features. The agent uses <code>lsp_workspace</code> to discover what's available:</p>
+
+<pre><code class="language-text">Workspace Information:
+- Root: /path/to/project
+- Server: gopls v0.14.0
+- File types: .go
+
+Available Capabilities:
+- Hover: Yes
+- Go to Definition: Yes
+- Find References: Yes
+- Rename: Yes
+- Code Actions: Yes
+- Formatting: Yes
+- Call Hierarchy: Yes
+- Type Hierarchy: Yes
+...</code></pre>
+
+<h2>Position Format</h2>
+
+<p>All LSP tools use <strong>1-based</strong> line and character positions:</p>
+
+<ul>
+  <li>Line 1 is the first line of the file</li>
+  <li>Character 1 is the first character on a line</li>
+</ul>
+
+<pre><code class="language-json">{
+  "file": "/path/to/file.go",
+  "line": 42,
+  "character": 15
+}</code></pre>
+
+<div class="callout callout-warning">
+  <div class="callout-title">⚠️ Server Installation</div>
+  <p>The LSP server must be installed and available in the system PATH. cagent does not install LSP servers automatically. Install them using your language's package manager (e.g., <code>go install golang.org/x/tools/gopls@latest</code>).</p>
+</div>
diff --git a/docs/pages/tools/user-prompt.html b/docs/pages/tools/user-prompt.html
new file mode 100644
index 000000000..80a4be87a
--- /dev/null
+++ b/docs/pages/tools/user-prompt.html
@@ -0,0 +1,173 @@
+<h1>User Prompt Tool</h1>
+<p class="subtitle">Ask the user questions and collect interactive input during agent execution.</p>
+
+<h2>Overview</h2>
+
+<p>The user prompt tool allows agents to ask questions and collect input from users during execution. This enables interactive workflows where the agent needs clarification, confirmation, or additional information before proceeding.</p>
+
+<div class="callout callout-info">
+  <div class="callout-title">ℹ️ When to Use</div>
+  <ul>
+    <li>When the agent needs clarification before proceeding</li>
+    <li>Collecting credentials or configuration values</li>
+    <li>Presenting choices and getting user decisions</li>
+    <li>Confirming destructive or important actions</li>
+  </ul>
+</div>
+
+<h2>Configuration</h2>
+
+<pre><code class="language-yaml">agents:
+  assistant:
+    model: openai/gpt-4o
+    description: Interactive assistant
+    instruction: |
+      You are a helpful assistant. When you need information
+      from the user, use the user_prompt tool to ask them.
+    toolsets:
+      - type: user_prompt
+      - type: filesystem
+      - type: shell</code></pre>
+
+<h2>Tool Interface</h2>
+
+<p>The <code>user_prompt</code> tool takes these parameters:</p>
+
+<table>
+  <thead><tr><th>Parameter</th><th>Type</th><th>Required</th><th>Description</th></tr></thead>
+  <tbody>
+    <tr><td><code>message</code></td><td>string</td><td>✓</td><td>The question or prompt to display</td></tr>
+    <tr><td><code>schema</code></td><td>object</td><td>✗</td><td>JSON Schema defining expected response structure</td></tr>
+  </tbody>
+</table>
+
+<h2>Response Format</h2>
+
+<p>The tool returns a JSON response:</p>
+
+<pre><code class="language-json">{
+  "action": "accept",
+  "content": {
+    "field1": "user value",
+    "field2": true
+  }
+}</code></pre>
+
+<h3>Action Values</h3>
+
+<table>
+  <thead><tr><th>Action</th><th>Meaning</th></tr></thead>
+  <tbody>
+    <tr><td><code>accept</code></td><td>User provided a response (check <code>content</code>)</td></tr>
+    <tr><td><code>decline</code></td><td>User declined to answer</td></tr>
+    <tr><td><code>cancel</code></td><td>User cancelled the prompt</td></tr>
+  </tbody>
+</table>
+
+<h2>Schema Examples</h2>
+
+<h3>Simple String Input</h3>
+
+<pre><code class="language-json">{
+  "type": "string",
+  "title": "API Key",
+  "description": "Enter your API key"
+}</code></pre>
+
+<h3>Multiple Choice</h3>
+
+<pre><code class="language-json">{
+  "type": "string",
+  "enum": ["development", "staging", "production"],
+  "title": "Environment",
+  "description": "Select the target environment"
+}</code></pre>
+
+<h3>Boolean Confirmation</h3>
+
+<pre><code class="language-json">{
+  "type": "boolean",
+  "title": "Confirm",
+  "description": "Are you sure you want to proceed?"
+}</code></pre>
+
+<h3>Object with Multiple Fields</h3>
+
+<pre><code class="language-json">{
+  "type": "object",
+  "properties": {
+    "username": {
+      "type": "string",
+      "description": "Your username"
+    },
+    "password": {
+      "type": "string",
+      "description": "Your password"
+    },
+    "remember": {
+      "type": "boolean",
+      "description": "Remember credentials"
+    }
+  },
+  "required": ["username", "password"]
+}</code></pre>
+
+<h3>Number Input</h3>
+
+<pre><code class="language-json">{
+  "type": "integer",
+  "title": "Port Number",
+  "description": "Enter the port number (1024-65535)",
+  "minimum": 1024,
+  "maximum": 65535
+}</code></pre>
+
+<h2>Example Usage</h2>
+
+<p>Here's how an agent might use the user prompt tool:</p>
+
+<pre><code class="language-text">Agent: I need to deploy this application. Let me ask which environment to target.
+
+[Calls user_prompt with message: "Which environment should I deploy to?" 
+ and schema with enum: ["development", "staging", "production"]]
+
+User selects: "staging"
+
+Agent: Great, I'll deploy to staging. Let me confirm this action.
+
+[Calls user_prompt with message: "Deploy to staging? This will replace the current version."
+ and schema with type: "boolean"]
+
+User confirms: true
+
+Agent: Deploying to staging...</code></pre>
+
+<h2>UI Presentation</h2>
+
+<p>How the prompt appears depends on the interface:</p>
+
+<ul>
+  <li><strong>TUI</strong>: Displays an interactive dialog with appropriate input controls</li>
+  <li><strong>CLI (exec mode)</strong>: Prints the prompt and reads from stdin</li>
+  <li><strong>API/MCP</strong>: Returns an elicitation request to the client</li>
+</ul>
+
+<div class="callout callout-tip">
+  <div class="callout-title">💡 Best Practice</div>
+  <p>Provide clear, concise messages. Include context about why you're asking and what the information will be used for. Use schemas with descriptions to guide users on expected input format.</p>
+</div>
+
+<h2>Handling Responses</h2>
+
+<p>The agent should handle all possible actions:</p>
+
+<ul>
+  <li><strong>accept</strong>: Process the <code>content</code> and continue</li>
+  <li><strong>decline</strong>: Acknowledge and try an alternative approach or explain what's needed</li>
+  <li><strong>cancel</strong>: Stop the current operation gracefully</li>
+</ul>
+
+<div class="callout callout-warning">
+  <div class="callout-title">⚠️ Context Requirement</div>
+  <p>The user prompt tool requires an elicitation handler to be configured. It works in the TUI and CLI modes but may not be available in all contexts (e.g., some MCP client configurations).</p>
+</div>