From b0e56868176eee3832e15370b1e8f6d5ac43f8a4 Mon Sep 17 00:00:00 2001 From: aorlov Date: Fri, 3 Apr 2026 00:44:02 +0200 Subject: [PATCH 01/19] feat: refactor MCP, fix Codex errors, reorganize AI agents documentation and add new content - Removed outdated GDPval benchmark command from evals section in AGENTS.md. - Updated the structure of the docs.json file to categorize AI agents under "MCP" and "Agents" groups, adding new pages for MCP and skills. - Introduced new documentation files for best practices, debugging, eval results, integrations, and skills, providing comprehensive guidance on using SuperDoc tools with LLMs. - Added detailed instructions on how to use the MCP server and its debugging features, enhancing the overall documentation for better user experience. --- AGENTS.md | 1 - apps/docs/ai/agents/best-practices.mdx | 171 +++++ apps/docs/ai/agents/debugging.mdx | 142 ++++ apps/docs/ai/agents/eval-results.mdx | 122 ++++ apps/docs/ai/agents/integrations.mdx | 615 ++++++++++++++++++ apps/docs/ai/agents/llm-tools.mdx | 388 +++++++++++ apps/docs/ai/agents/skills.mdx | 18 + apps/docs/ai/mcp/debugging.mdx | 57 ++ apps/docs/ai/mcp/how-to-use.mdx | 107 +++ apps/docs/ai/mcp/overview.mdx | 121 ++++ apps/docs/docs.json | 53 +- apps/mcp/src/index.ts | 79 +-- apps/mcp/src/server.ts | 47 ++ .../src/__tests__/contract-integrity.test.ts | 18 +- .../sdk/codegen/src/generate-intent-tools.mjs | 26 +- .../sdk/langs/browser/src/intent-dispatch.ts | 4 + .../sdk/langs/browser/src/system-prompt.ts | 243 ++++--- packages/sdk/langs/node/src/index.ts | 9 +- packages/sdk/langs/node/src/tools.ts | 24 +- .../sdk/langs/python/superdoc/__init__.py | 2 + .../sdk/langs/python/superdoc/tools_api.py | 24 +- packages/sdk/scripts/sdk-generate.mjs | 4 +- packages/sdk/scripts/sdk-validate.mjs | 1 + .../prompt-templates/system-prompt-core.md | 205 ++++++ .../system-prompt-mcp-header.md | 19 + .../system-prompt-sdk-header.md | 5 + packages/sdk/tools/system-prompt-mcp.md | 223 +++++++ packages/sdk/tools/system-prompt.md | 21 +- 28 files changed, 2584 insertions(+), 165 deletions(-) create mode 100644 apps/docs/ai/agents/best-practices.mdx create mode 100644 apps/docs/ai/agents/debugging.mdx create mode 100644 apps/docs/ai/agents/eval-results.mdx create mode 100644 apps/docs/ai/agents/integrations.mdx create mode 100644 apps/docs/ai/agents/llm-tools.mdx create mode 100644 apps/docs/ai/agents/skills.mdx create mode 100644 apps/docs/ai/mcp/debugging.mdx create mode 100644 apps/docs/ai/mcp/how-to-use.mdx create mode 100644 apps/docs/ai/mcp/overview.mdx create mode 100644 apps/mcp/src/server.ts create mode 100644 packages/sdk/tools/prompt-templates/system-prompt-core.md create mode 100644 packages/sdk/tools/prompt-templates/system-prompt-mcp-header.md create mode 100644 packages/sdk/tools/prompt-templates/system-prompt-sdk-header.md create mode 100644 packages/sdk/tools/system-prompt-mcp.md diff --git a/AGENTS.md b/AGENTS.md index 2b66d37665..5bb3508bbe 100644 --- a/AGENTS.md +++ b/AGENTS.md @@ -124,7 +124,6 @@ The `evals/` directory contains a Promptfoo-based evaluation suite for validatin |---------|-------------|------| | `pnpm --filter @superdoc-testing/evals run eval` | Run deterministic evals (reading + argument tests) | ~$0.30 | | `pnpm --filter @superdoc-testing/evals run eval:reading` | Run reading tool tests only | ~$0.15 | -| `pnpm --filter @superdoc-testing/evals run eval:gdpval` | Run GDPval benchmark (Model+SuperDoc vs Model-Only) | ~$1-2 | | `pnpm --filter @superdoc-testing/evals run eval:view` | Open Promptfoo web UI with results | Free | | `pnpm --filter @superdoc-testing/evals run baseline:save