docs(codex): add MCP transport troubleshooting and restart policy by Xangel0s · Pull Request #166 · Gentleman-Programming/engram

Xangel0s · 2026-04-08T23:17:57Z

Closes #167

Summary

Adds official troubleshooting guidance for Codex MCP Transport closed failures when using Engram memory tools.

Problem

In real workflows, users can hit:

tool call failed ... Caused by: Transport closed

This appears most often after:

replacing engram.exe
editing Codex MCP config or instruction files
force-stopping running engram processes

Root Cause

Codex may keep a stale MCP stdio channel after session-boundary operational changes. The HTTP server can be healthy while MCP transport in the active chat is stale.

Changes

Added Codex-focused troubleshooting section in docs/AGENT-SETUP.md
Added operator troubleshooting section in README_CONTEXT.md
Standardized:
- recovery sequence
- preventive restart rule after binary/config updates

Recovery Sequence (documented)

Start a new Codex chat or reload the VS Code window
If issue persists, restart VS Code
Validate with one Engram MCP call (mem_context or mem_save)

Why This Helps

Reduces recurring support friction around MCP transport errors
Clarifies distinction between HTTP health and MCP stdio health
Provides deterministic, low-friction operator playbook

Scope and Risk

Docs-only change
No runtime logic, schema, API, or behavior modified

Validation

Reproduced the failure pattern in Windows workflow
Confirmed recovery via session restart/new chat
Confirmed documentation now captures both reactive and preventive guidance

Copilot

Pull request overview

This PR adds documentation to help users recover from Codex MCP stdio failures (notably Transport closed) when using Engram memory tools, and proposes a restart policy after binary/config changes.

Changes:

Added a Codex-specific troubleshooting section for Transport closed in docs/AGENT-SETUP.md.
Added MCP transport troubleshooting guidance to README_CONTEXT.md (alongside a broader “Context Bus Manual” document).

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 7 comments.

File	Description
README_CONTEXT.md	Introduces a new operator/agent context manual and includes an MCP transport troubleshooting section.
docs/AGENT-SETUP.md	Adds Codex troubleshooting steps and a restart policy note for stale MCP stdio sessions.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-08T23:22:41Z

README_CONTEXT.md

+## API Endpoints
+
+### POST /remember
+
+Stores one context pill in `context_logs`.
+
+Required fields:
+- `agent_source`: must be one of `codex`, `gemini`, `system`.
+- `content`: the context pill text.
+


The HTTP API described here (POST /remember with agent_source/content + context_logs table) does not match Engram’s implemented HTTP server. The server routes are /sessions, /observations, /search, /timeline, /context, etc. (see internal/server/server.go routes), and there is no /remember endpoint or context_logs table in the current schema. Please update this section to reflect the actual API, or explicitly label it as a separate/legacy service so readers aren’t misled.

Copilot · 2026-04-08T23:22:42Z

README_CONTEXT.md

+### GET /recall
+
+Returns enriched recall payload with:
+- `focused_observation`: single focus context log.
+- `timeline`: chronological surrounding entries (up to 5 before + 5 after).
+
+Query params:
+- `q` optional: partial match search over `content`.
+- `agent_source` optional: filter to one source (`codex`, `gemini`, `system`).
+- `mode` optional: `compact` (default) or `full`.
+- `recall_profile` optional: `lean`, `balanced`, `deep` (native presets).
+- `timeline_limit` optional: number of items before/after focus (default 5, max 20).
+- `max_chars` optional: compact preview char budget (default 180, min 60, max 500).


GET /recall (and its query params like recall_profile, timeline_limit, max_chars) is documented as a core endpoint, but there is no /recall handler in the current HTTP server (internal/server/server.go). If the intended endpoint is /context, /search, or /timeline, the docs should be adjusted accordingly.

Copilot · 2026-04-08T23:22:42Z

README_CONTEXT.md

+Example request body:
+{
+  "agent_source": "codex",
+  "content": "Refactored recall query to return chronological timeline around focus.",
+  "metadata": {
+    "file_path": "internal/store/store.go",
+    "status": "in_progress",
+    "priority": "high"
+  }
+}


The JSON example request body is not inside a fenced code block, so it won’t render cleanly in Markdown and is easy to miscopy. Consider wrapping the example in a ```json fenced block (and likewise for other raw payload examples).

Copilot · 2026-04-08T23:22:42Z

README_CONTEXT.md

+| Content Type | Compression Rule |
+|---|---|
+| Go/Python-like code | Keep declaration line (`func ...`, `class ...`, or first structural line). |
+| System logs | Keep last non-empty line; if HTTP/server error code exists, emit `Error <code>`. |
+| Markdown | Keep only heading lines (`#`, `##`) and join compactly. |
+| Plain text / notes | Normalize repeated newlines and truncate to 180 chars with ellipsis. |
+


This Markdown table uses leading double pipes (e.g., "|| Content Type | Compression Rule |"), which doesn’t render as a table in standard Markdown. Please convert it to a normal pipe table format so it renders correctly in GitHub.

Copilot · 2026-04-08T23:22:42Z

README_CONTEXT.md

+### GET /
+
+Returns an embedded dark-mode monitoring dashboard (HTML) rendered directly by the Go server.
+
+Behavior:
+- Shows the latest 20 rows from `context_logs`.
+- Colors `agent_source` badges (`codex` in blue, `gemini` in purple, `system` in green).
+- Auto-refreshes every 5 seconds using a lightweight fetch call.
+
+### GET /dashboard/logs
+
+Returns JSON rows for dashboard live-refresh.
+
+Query params:
+- `limit` optional: defaults to 20, capped at 200.
+
+Response behavior:
+- Ordered newest-to-oldest.
+- Includes `timestamp_human` for display formatting.
+


The root/dashboard endpoints documented here (GET / returning an HTML dashboard and GET /dashboard/logs) don’t appear to exist in the current HTTP server routes (internal/server/server.go). As written, readers will expect these endpoints to work but they won’t. Please either document the actual endpoints (e.g., /health, /stats, etc.) or remove/relocate this section.

Copilot · 2026-04-08T23:22:42Z

README_CONTEXT.md

+## SSD Sidecar (.engram-ssd.md)
+
+The server keeps a small sidecar file in the project root: `.engram-ssd.md`.
+
+Behavior:
+- On `engram serve` startup: if the file does not exist, bootstrap it from the latest 15 context logs.
+- On every successful `POST /remember`: trigger async SSD refresh (non-blocking HTTP response).
+- File write is atomic (temp file + rename) to avoid partial reads by agents.
+


The SSD sidecar behavior (.engram-ssd.md creation/refresh, bootstrap from latest 15 “context logs”, refresh on POST /remember) is documented as implemented behavior, but there is no reference to .engram-ssd.md in the repo and no /remember endpoint. Please avoid stating this as a runtime guarantee unless the feature exists, or clearly mark it as a proposed/experimental contract with pointers to the implementation.

Copilot · 2026-04-08T23:22:43Z

README_CONTEXT.md

+# Context Bus Manual for AI Agents
+
+This document defines how AI agents should write and recall shared context in Engram.
+
+## Purpose
+
+Engram Context Bus enables multiple agents to share short, high-signal context logs through a local REST API backed by SQLite.
+
+Goals:
+- Preserve work progress across sessions.
+- Keep memory entries compact and searchable.
+- Allow agent-aware handoff between Codex, Gemini, and system processes.


PR title/description indicate this is primarily MCP transport troubleshooting guidance, but this file introduces a full “Context Bus Manual” (API contract, compression rules, SSD sidecar, etc.) well beyond the described scope. Either trim this file to the intended troubleshooting section or expand the PR description to explain why this larger, new API/manual belongs in this change.

docs(codex): add MCP transport troubleshooting and restart policy

7f99aea

Copilot AI review requested due to automatic review settings April 8, 2026 23:17

Copilot started reviewing on behalf of Xangel0s April 8, 2026 23:18 View session

Copilot AI reviewed Apr 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(codex): add MCP transport troubleshooting and restart policy#166

docs(codex): add MCP transport troubleshooting and restart policy#166
Xangel0s wants to merge 1 commit intoGentleman-Programming:mainfrom
Xangel0s:docs/codex-mcp-transport-troubleshooting

Xangel0s commented Apr 8, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 8, 2026

Uh oh!

Copilot AI Apr 8, 2026

Uh oh!

Copilot AI Apr 8, 2026

Uh oh!

Copilot AI Apr 8, 2026

Uh oh!

Copilot AI Apr 8, 2026

Uh oh!

Copilot AI Apr 8, 2026

Uh oh!

Copilot AI Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Xangel0s commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Root Cause

Changes

Recovery Sequence (documented)

Why This Helps

Scope and Risk

Validation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Xangel0s commented Apr 8, 2026 •

edited

Loading