security: auth consolidation, vault singleton, cwd hardening, LLM prompt sanitization by thebtf · Pull Request #10 · thebtf/engram

thebtf · 2026-03-16T22:51:17Z

Summary

Addresses all findings from security audit .agent/reports/security-audit-report.md.

HIGH fixes

S8-001: Moved 15 admin endpoints into authenticated route group (/api/update/apply, /api/restart, /api/logs, /api/backfill/session, etc.)
S9-002: Startup warning when ENGRAM_API_TOKEN unset on non-loopback bind
S10-008: Auth failure logging with path + remote_addr

MEDIUM fixes

S5-004: Vault singleton via sync.Once on Service struct (no more per-call crypto.NewVault)
S9-003: Override client-provided cwd="" in HTTP handlers to prevent filesystem probing
AG06-005: Sanitize transcript XML boundary tags before LLM prompt embedding

Kept outside auth (public, read-only)

/health, /api/ready, /api/version, MCP endpoints, static files

Test plan

go build ./cmd/worker/
go test ./internal/worker/... ./internal/privacy/... ./internal/crypto/...
Deploy, verify endpoints return 401 without token
Verify vault credentials still work after singleton migration
Verify context injection works without cwd

Summary by CodeRabbit

Примечания к выпуску

Bug Fixes
- Отключено серверное обнаружение путей (cwd) для предотвращения проб и утечек файловой системы.
- Транскрипты теперь проходят санитизацию — XML-подобные теги экранируются.
Security Improvements
- Административные эндпоинты объединены под требование токена; добавлены предупреждения при отключённой или недействительной аутентификации.
- Доступ к секретам централизован; ошибки доступа логируются и не приводят к аварии.
Chores
- Ленивая, кэширующая инициализация хранилища секретов и сопутствующие логирования.
- Плагин обновлён до версии 1.2.1; наблюдатель файлов теперь запускается асинхронно (start возвращает Promise).

…9-003) S5-004: Add vault singleton to Service struct with sync.Once initialization. Replace per-call crypto.NewVault(config.Get()) in vaultStoreDetectedSecrets with s.getVault(), eliminating vault fragmentation and redundant key reads. S9-003: Override client-provided cwd to empty string in both HTTP handlers (handleSearchByPrompt, handleContextInject) before any filesystem operations. The server has no access to client filesystems; cwd-based mtime checks are client-side concerns only. Prevents filesystem probing via cwd parameter.

Strip XML boundary tags from transcript content before embedding in LLM prompts to prevent prompt injection via crafted session content.

coderabbitai · 2026-03-16T22:51:30Z

Caution

Review failed

The pull request is closed.

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: f44fb0af-1899-4a2b-bd72-16ad6dd123db

📥 Commits

Reviewing files that changed from the base of the PR and between ae1a5d5 and d78640d.

📒 Files selected for processing (2)

plugin/openclaw-engram/package.json
plugin/openclaw-engram/src/services/file-watcher.ts

Walkthrough

Добавлены санация XML-подобных тегов в сборке промпта для backfill, ленивый singleton-доступ к Vault через Service.getVault(), принудительная очистка cwd в контекстных обработчиках и улучшенное логирование/проверки токена; административные маршруты сгруппированы под защищённой группой tokenAuth.

Changes

Cohort / File(s)	Summary
Санитизация входа `internal/backfill/extract/extract.go`	Добавлено регулярное выражение для XML-подобных тегов и шаг санации транскрипта в `BuildUserPrompt`; шаблон теперь получает санитизированный текст вместо сырых угловых скобок.
Vault — ленивый singleton `internal/worker/service.go`, `internal/worker/handlers_backfill.go`	В `Service` добавлены поля `vault`, `vaultOnce`, `vaultErr` и метод `getVault()`; обработчики backfill теперь получают Vault через `s.getVault()` и корректно обрабатывают отсутствие Vault без прямых вызовов глобального конфига/crypto.
Контекст/безопасность файловой системы `internal/worker/handlers_context.go`	После разбора параметров `cwd` значение теперь принудительно сбрасывается в пустую строку, отключая серверную проверку FS/mtimes, зависящую от client-provided `cwd`.
Аутентификация и маршрутизация `internal/worker/middleware.go`, `internal/worker/service.go`	Добавлены предупреждения при отсутствии `ENGRAM_API_TOKEN`, улучшены логи при неверном токене (включая путь и `remote_addr`); административные маршруты сгруппированы под `tokenAuth` (реорганизация регистраций маршрутов).
Плагины / мелкие изменения `plugin/openclaw-engram/package.json`, `plugin/openclaw-engram/src/services/file-watcher.ts`	Мажорный/минимальный: версия плагина увеличена; `FileWatcherService.start` стал асинхронным с ленивым динамическим импортом `chokidar` (type-only статический импорт заменён на динамический).

Sequence Diagram(s)

sequenceDiagram
    participant Client
    participant Middleware
    participant Service
    participant Handlers
    participant Vault

    Client->>Middleware: HTTP запрос (с токеном/без)
    Middleware->>Service: проверка токена
    alt токен отсутствует/невалиден
        Middleware-->>Client: 401 / warning лог
    else токен валиден или аутентификация отключена
        Middleware->>Service: передать запрос
        Service->>Handlers: маршрутизация (admin/общий)
        Handlers->>Service: запрошен доступ к Vault
        Service->>Vault: s.getVault() (ленивая инициализация)
        alt Vault инициализирован
            Vault-->>Service: возвращает доступ
            Service-->>Handlers: отдаёт Vault/секреты
            Handlers-->>Client: ответ (с учётом секретов)
        else Vault недоступен
            Service-->>Handlers: возвращает ошибку/nil
            Handlers-->>Client: ответ без Vault (лог/предупреждение)
        end
    end

Estimated code review effort

🎯 3 (Умеренная) | ⏱️ ~25 минут

Possibly related PRs

security: auth consolidation, vault singleton, cwd hardening, LLM prompt sanitization #10: очень близко по изменениям — содержит те же правки по XML-санации в extract.go, ленивому getVault() и очистке cwd.
fix: post-merge cleanup — vault refactor + secret detection #4: тот же рефакторинг Vault — перенос состояния Vault в Service и добавление ленивого инициализатора getVault().
feat: self-learning utility signals #1: связанный PR с введением санации угловых скобок/HTML/XML в транскриптах для генерации промптов.

Poem

🐇 Я — кролик, тихо колышу усы,
Теги в скобки превращаю без спешки,
Секрет храню в синглтоне под замком,
Токен проверяю — путь в лог ложу,
Прыгаю в коде, где порядок и дружок.

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title comprehensively summarizes the four main security improvements: auth consolidation, vault singleton pattern, cwd hardening, and LLM prompt sanitization.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/security-hardening

📝 Coding Plan

Generate coding plan for human review comments

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist · 2026-03-16T22:51:33Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the application's security posture by addressing multiple findings from a recent security audit. The changes focus on centralizing authentication for sensitive API endpoints, improving the secure handling of cryptographic operations through a vault singleton, mitigating filesystem probing risks, and preventing prompt injection vulnerabilities in LLM interactions. Additionally, it introduces clearer warnings and logging for security-related events.

Highlights

Authentication Consolidation (S8-001): Moved 15 administrative and management endpoints, including update, restart, logging, and backfill functionalities, into a new authenticated route group, ensuring these critical operations require a valid API token.
Vault Singleton (S5-004): Implemented a singleton pattern for the crypto.Vault using sync.Once within the Service struct, ensuring that the vault is initialized only once and its state is consistently managed across the application.
CWD Hardening (S9-003): Overrode client-provided current working directory (CWD) values in HTTP handlers to an empty string, preventing potential filesystem probing vulnerabilities.
LLM Prompt Sanitization (AG06-005): Added sanitization for transcript content before embedding it into LLM prompts, specifically replacing XML-like boundary tags to prevent injection attacks.
Startup Warning for Unauthenticated Bind (S9-002): Introduced a warning log at startup if the server is listening on a non-loopback address without the ENGRAM_API_TOKEN being set, alerting users to an insecure configuration.
Auth Failure Logging (S10-008): Enhanced authentication failure logging to include the request path and remote address, providing better visibility into unauthorized access attempts.

Changelog

internal/backfill/extract/extract.go
- Added XML tag sanitization to the BuildUserPrompt function to prevent LLM prompt injection.
internal/worker/handlers_backfill.go
- Removed direct imports for config and crypto packages.
- Updated vaultStoreDetectedSecrets to use the service's new getVault method for vault access.
internal/worker/handlers_context.go
- Modified handleSearchByPrompt and handleContextInject to explicitly set cwd to an empty string, ignoring client-provided values.
internal/worker/middleware.go
- Imported the zerolog/log package.
- Added a warning log in NewTokenAuth if ENGRAM_API_TOKEN is not configured.
- Implemented logging for unauthorized access attempts in TokenAuth.Middleware, including request path and remote address.
internal/worker/service.go
- Imported the crypto package.
- Added fields (vault, vaultOnce, vaultErr) to the Service struct to support a crypto.Vault singleton.
- Implemented the getVault method to provide a thread-safe, cached crypto.Vault instance.
- Reorganized API routes by moving numerous administrative and management endpoints into an authenticated chi.Router group.
- Added a startup warning in the Start method if the server binds to a non-loopback address without authentication enabled.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

thebtf · 2026-03-16T22:52:08Z

@coderabbitai review focus:security

thebtf · 2026-03-16T22:52:11Z

@gemini-code-assist review focus:security

thebtf · 2026-03-16T22:52:14Z

@codex review

coderabbitai · 2026-03-16T22:52:21Z

@thebtf Sure! I'll conduct a security-focused review of this PR right away.

Conduct a thorough security-focused review. Pay special attention to: (1) whether all admin/sensitive endpoints are properly protected behind authentication middleware, (2) correctness of the vault singleton implementation using sync.Once, (3) the cwd override logic to prevent filesystem probing, (4) LLM prompt sanitization against XML/prompt-injection attacks, (5) auth failure logging completeness (path + remote_addr), and (6) the startup warning when ENGRAM_API_TOKEN is unset on non-loopback binds. Flag any endpoints that may have been missed in the auth consolidation, any race conditions in the singleton, or any bypass vectors in the sanitization logic.

✅ Actions performed

Full review triggered.

gemini-code-assist

Code Review

This pull request introduces several important security enhancements and refactorings. The consolidation of admin endpoints under a single authentication middleware is a significant improvement, as is the introduction of a singleton pattern for the vault service, which improves both performance and resource management. The hardening of HTTP handlers by ignoring client-provided cwd and the sanitization of LLM prompts are also valuable security fixes. My review includes one suggestion to make the prompt sanitization logic more robust using a regular expression.

gemini-code-assist · 2026-03-16T22:53:30Z

+	// Strip XML-like tags that could break the prompt boundary
+	sanitized := strings.ReplaceAll(transcript, "</session_transcript>", "[/session_transcript]")
+	sanitized = strings.ReplaceAll(sanitized, "</observations>", "[/observations]")
+	sanitized = strings.ReplaceAll(sanitized, "<observation>", "[observation]")
+	return fmt.Sprintf(UserPromptTemplate, projectPath, gitBranch, durationMin, exchangeCount, chunkInfo, alreadyExtracted, sanitized)


The current sanitization logic is a good step towards preventing prompt injection, but it's a bit brittle as it only handles three specific hardcoded tags. A more robust approach would be to use a regular expression to replace any XML-like tags. This would handle variations in spacing, attributes, and other tags that might be present in the transcript, providing stronger protection against prompt boundary confusion.

For example, you could use a single regex to handle all cases. Note that you will need to add regexp to your imports.

// It's good practice to compile regexes once at the package level. var xmlTagRegex = regexp.MustCompile(`<([^>]+)>`) // ... in BuildUserPrompt ... sanitized := xmlTagRegex.ReplaceAllString(transcript, "[$1]")

This would replace <tag> with [tag] and </tag> with [/tag], which is more comprehensive and maintainable.

Suggested change

// Strip XML-like tags that could break the prompt boundary

sanitized := strings.ReplaceAll(transcript, "</session_transcript>", "[/session_transcript]")

sanitized = strings.ReplaceAll(sanitized, "</observations>", "[/observations]")

sanitized = strings.ReplaceAll(sanitized, "<observation>", "[observation]")

return fmt.Sprintf(UserPromptTemplate, projectPath, gitBranch, durationMin, exchangeCount, chunkInfo, alreadyExtracted, sanitized)

// Strip XML-like tags that could break the prompt boundary.

// A regex is more robust than replacing specific hardcoded tags.

re := regexp.MustCompile(`<([^>]+)>`)

sanitized := re.ReplaceAllString(transcript, "[$1]")

return fmt.Sprintf(UserPromptTemplate, projectPath, gitBranch, durationMin, exchangeCount, chunkInfo, alreadyExtracted, sanitized)

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@internal/backfill/extract/extract.go`:
- Around line 179-183: Sanitization currently only replaces two closing tags and
misses critical tags and case variations; update the sanitization that assigns
sanitized from transcript (the sanitized variable used when formatting
UserPromptTemplate) to replace all of these tags: <session_transcript>,
</session_transcript>, <observations>, </observations>, <observation>, and
</observation>, and perform the replacements in a case-insensitive manner (e.g.,
use regexp with the (?i) flag or equivalent) so tags like </SESSION_TRANSCRIPT>
are handled consistently; ensure the code replaces the original tags with safe
bracketed forms before calling fmt.Sprintf(UserPromptTemplate, ...) and remove
the old incomplete ReplaceAll calls.

In `@internal/worker/service.go`:
- Around line 1568-1606: Remove the redundant middleware application inside the
admin routes group: delete the r.Use(s.tokenAuth.Middleware) call in the
s.router.Group block so the group no longer re-applies authentication already
set up globally in setupMiddleware(); leave the route registrations (e.g.,
s.handleRebuildStatus, s.handleTriggerVectorRebuild, s.handleVectorHealth,
s.handleVectorMetrics, s.handleGraphStats, s.handleUpdateCheck,
s.handleUpdateApply, s.handleUpdateStatus, s.handleUpdateRestart,
s.handleRestart, s.handleSelfCheck, s.sseBroadcaster.HandleSSE, s.handleGetLogs,
s.handleInstinctsImport, s.handleBackfillIngest, s.handleBackfillSession,
s.handleBackfillStatus) intact and rely on the global tokenAuth.Middleware and
its ExemptPaths mechanism (defined in middleware.go) for correct authentication
behavior.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 1ded6ade-db32-4c14-bb35-3ea7df2ed639

📥 Commits

Reviewing files that changed from the base of the PR and between 52ef629 and 04295bc.

📒 Files selected for processing (5)

internal/backfill/extract/extract.go
internal/worker/handlers_backfill.go
internal/worker/handlers_context.go
internal/worker/middleware.go
internal/worker/service.go

coderabbitai · 2026-03-16T22:55:58Z

+	// Strip XML-like tags that could break the prompt boundary
+	sanitized := strings.ReplaceAll(transcript, "</session_transcript>", "[/session_transcript]")
+	sanitized = strings.ReplaceAll(sanitized, "</observations>", "[/observations]")
+	sanitized = strings.ReplaceAll(sanitized, "<observation>", "[observation]")
+	return fmt.Sprintf(UserPromptTemplate, projectPath, gitBranch, durationMin, exchangeCount, chunkInfo, alreadyExtracted, sanitized)


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🌐 Web query:

What XML tag injection techniques exist for LLM prompt injection attacks?

💡 Result:

Common XML tag–based prompt-injection techniques (seen when apps “structure” prompts with XML-like delimiters and then concatenate untrusted text into those sections) include:

Tag boundary break-out (early close + new role/tag): attacker input includes a closing tag for the “user/content” region and then opens an “assistant/system/instructions” region so the model treats attacker text as higher-priority instructions (the classic XML-tag boundary exploitation pattern). [1]

Tag spoofing / role impersonation: attacker wraps their content in tags that look like the application’s control tags (e.g., “instructions”, “system”, “assistant”), betting the model will follow them even if the app didn’t intend to grant that authority. This risk exists specifically because XML tags are recommended/used as separators in some LLM prompt formats. [2][3]

Parser/model mismatch: the application may parse <message> (or similar) tags into chat messages; if the app’s parsing/validation is lax, attacker-controlled tags can be interpreted as additional messages/roles (message-tag injection). [4]

Encoding & normalization bypasses around < / >: use HTML entities, Unicode lookalikes, or zero-width characters so input evades naïve “block angle brackets” filters, but is later decoded/normalized (or still “understood” by the model as tags). [1]

“Hidden instruction” zones inside XML constructs: put instructions in places humans/tools may overlook but models may still heed, such as comments (), CDATA sections, long attribute values, or deeply nested structures (often combined with “ignore previous instructions”–style directives). (General prompt-injection pattern; XML is just the carrier.) [5]

Tag collision with orchestration templates: when platforms/frameworks use fixed XML sections like <examples>, <instructions>, etc., attacker input that collides with those boundaries can cause the model to re-interpret where examples/instructions begin/end. [2][3]

A common mitigation specifically discussed for XML-structured prompts is “salted tags” (session-specific tag names) to reduce tag spoofing/collision. [6]

Sources: [1] dataset write-up of “Anthropic XML Tag Boundary Exploitation” pattern; [2] Anthropic XML-tag prompting guidance; [3] AWS Bedrock advanced prompt templates mentioning XML tags; [4] Microsoft Semantic Kernel note on parsing <message> tags and controlling injection; [5] OWASP Prompt Injection overview; [6] AWS prescriptive guidance PDF describing salted tags. [1][2][3][4][5][6]

🏁 Script executed:

# Find and examine the extract.go file find . -name "extract.go" -path "*/internal/backfill/extract/*" -type f

Repository: thebtf/engram

Length of output: 95

🏁 Script executed:

# Once located, examine the full BuildUserPrompt function and UserPromptTemplate cat -n internal/backfill/extract/extract.go | head -250 | tail -100

Repository: thebtf/engram

Length of output: 4261

🏁 Script executed:

# Search for UserPromptTemplate definition rg "UserPromptTemplate\s*=" internal/backfill/extract/

Repository: thebtf/engram

Length of output: 203

🏁 Script executed:

# Check if there are opening tags that need sanitization rg "<session_transcript>|<observations>|<observation>" internal/backfill/extract/extract.go

Repository: thebtf/engram

Length of output: 185

🏁 Script executed:

# Get the full UserPromptTemplate definition rg "const UserPromptTemplate" -A 100 internal/backfill/extract/extract.go

Repository: thebtf/engram

Length of output: 2962

🏁 Script executed:

# Check where and how the tags are used in the template rg "<session_transcript>|</session_transcript>|<observations>|</observations>|<observation>|</observation>" -n internal/backfill/extract/extract.go

Repository: thebtf/engram

Length of output: 447

🏁 Script executed:

# Search for tests related to BuildUserPrompt or sanitization fd "test" internal/backfill/extract/ --type f

Repository: thebtf/engram

Length of output: 39

🏁 Script executed:

# Search for AG06-005 reference in code or docs rg "AG06-005" -B 3 -A 3

Repository: thebtf/engram

Length of output: 757

🏁 Script executed:

# Check if there are any examples or comments about the threat model rg "prompt injection|sanitiz|transcript" internal/backfill/extract/extract.go -B 2 -A 2

Repository: thebtf/engram

Length of output: 1595

Неполная санитизация XML-тегов: отсутствуют критические открывающие и закрывающие теги.

Текущая реализация защищает от разрыва границ </session_transcript> и </observations>, но пропускает важные векторы атаки:

Отсутствует <session_transcript> — инъекция могла бы создать вложенную структуру и запутать парсер LLM

Отсутствует </observation> — инъекция могла бы преждевременно закрыть наблюдение

Регистронезависимость не реализована — </SESSION_TRANSCRIPT> или </Session_Transcript> не будут заменены

Кроме того, предложенный в коде подход с циклом неэффективен и может привести к проблемам с Юникодом.

Рекомендуется использовать простой и надёжный подход, например регулярные выражения с флагом регистронезависимости или тщательную обработку без циклов:

// Более надёжный вариант с использованием простых замен var tagsToReplace = [][2]string{ {"<session_transcript>", "[session_transcript]"}, {"</session_transcript>", "[/session_transcript]"}, {"<observations>", "[observations]"}, {"</observations>", "[/observations]"}, {"<observation>", "[observation]"}, {"</observation>", "[/observation]"}, } sanitized := transcript for _, pair := range tagsToReplace { sanitized = strings.ReplaceAll(sanitized, pair[0], pair[1]) }

Для обработки разных регистров требуется более тщательное решение (например, через регулярные выражения с флагом (?i)).

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@internal/backfill/extract/extract.go` around lines 179 - 183, Sanitization currently only replaces two closing tags and misses critical tags and case variations; update the sanitization that assigns sanitized from transcript (the sanitized variable used when formatting UserPromptTemplate) to replace all of these tags: <session_transcript>, </session_transcript>, <observations>, </observations>, <observation>, and </observation>, and perform the replacements in a case-insensitive manner (e.g., use regexp with the (?i) flag or equivalent) so tags like </SESSION_TRANSCRIPT> are handled consistently; ensure the code replaces the original tags with safe bracketed forms before calling fmt.Sprintf(UserPromptTemplate, ...) and remove the old incomplete ReplaceAll calls.

gemini-code-assist

Code Review

This pull request introduces several significant security enhancements. The consolidation of administrative endpoints under a new authenticated route group is a great step forward. The introduction of a vault singleton improves performance and resource management. Additionally, the hardening of CWD handling and LLM prompt sanitization effectively address potential vulnerabilities. My review includes a suggestion to improve the robustness of the prompt sanitization and points out a potential redundancy in the application of authentication middleware.

gemini-code-assist · 2026-03-16T22:56:09Z

+	sanitized := strings.ReplaceAll(transcript, "</session_transcript>", "[/session_transcript]")
+	sanitized = strings.ReplaceAll(sanitized, "</observations>", "[/observations]")
+	sanitized = strings.ReplaceAll(sanitized, "<observation>", "[observation]")


The current sanitization logic uses a denylist of specific tags, which is brittle and incomplete. For example, it misses the opening <observations> tag and the closing </observation> tag, which could still allow for prompt injection.

Using strings.Replacer is also more efficient and maintainable than chaining strings.ReplaceAll calls.

sanitized := strings.NewReplacer( "</session_transcript>", "[/session_transcript]", "<observations>", "[observations]", "</observations>", "[/observations]", "<observation>", "[observation]", "</observation>", "[/observation]", ).Replace(transcript)

gemini-code-assist · 2026-03-16T22:56:09Z

-	s.router.Post("/api/update/restart", s.handleUpdateRestart)
+	// Admin/management routes — require authentication, work before DB is ready
+	s.router.Group(func(r chi.Router) {
+		r.Use(s.tokenAuth.Middleware)


The s.tokenAuth.Middleware appears to be already applied globally to the router in the setupMiddleware function. Applying it again within this group is redundant. If the intention is to only authenticate these admin routes, the global middleware should be removed. If the global middleware is intended, then this line is unnecessary.

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (1)

internal/backfill/extract/extract.go (1)

177-183: ⚠️ Potential issue | 🟠 Major

Неполная санация boundary-тегов оставляет обход prompt-инъекции.

Сейчас фильтруются только </session_transcript>, </observations> и <observation>. Остаются <session_transcript>, <observations>, </observation> и варианты регистра (например, </SESSION_TRANSCRIPT>), поэтому разрыв XML-границ всё ещё возможен.

💡 Предложение правки

 import (
 	"encoding/xml"
 	"fmt"
+	"regexp"
 	"strings"
@@
 )
+
+var promptBoundaryTagRE = regexp.MustCompile(`(?i)</?(session_transcript|observations|observation)>`)
@@
 func BuildUserPrompt(projectPath, gitBranch string, durationMin, exchangeCount int, chunkInfo, alreadyExtracted, transcript string) string {
-	// Strip XML-like tags that could break the prompt boundary
-	sanitized := strings.ReplaceAll(transcript, "</session_transcript>", "[/session_transcript]")
-	sanitized = strings.ReplaceAll(sanitized, "</observations>", "[/observations]")
-	sanitized = strings.ReplaceAll(sanitized, "<observation>", "[observation]")
+	// Strip XML-like tags that could break the prompt boundary
+	sanitized := promptBoundaryTagRE.ReplaceAllStringFunc(transcript, func(tag string) string {
+		switch strings.ToLower(tag) {
+		case "<session_transcript>":
+			return "[session_transcript]"
+		case "</session_transcript>":
+			return "[/session_transcript]"
+		case "<observations>":
+			return "[observations]"
+		case "</observations>":
+			return "[/observations]"
+		case "<observation>":
+			return "[observation]"
+		case "</observation>":
+			return "[/observation]"
+		default:
+			return tag
+		}
+	})
 	return fmt.Sprintf(UserPromptTemplate, projectPath, gitBranch, durationMin, exchangeCount, chunkInfo, alreadyExtracted, sanitized)
 }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@internal/backfill/extract/extract.go` around lines 177 - 183,
BuildUserPrompt's current sanitization only replaces a few exact-tag strings and
misses opening tags, closing </observation>, and different casing, leaving a
prompt-injection vector; update the sanitization of the transcript variable in
BuildUserPrompt to normalize/escape all relevant XML-like tags
case-insensitively (at minimum both opening and closing variants of
session_transcript, observations, and observation) — e.g., perform
case-insensitive replacements or use a regex that matches
(?i)</?session_transcript>, (?i)</?observations>, (?i)</?observation> and
replace them with safe bracketed or escaped equivalents so no angle-bracket
boundaries remain in the final formatted prompt.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@internal/worker/handlers_context.go`:
- Around line 780-784: После присвоения cwd = "" функция handleContextInject всё
ещё вызывает sdk.GetFileMtimes(...) и выполняет проверки mtime на сервере;
оградите все вызовы sdk.GetFileMtimes в handleContextInject условием, например
if cwd != "" { ... }, чтобы эти блоки выполнялись только на клиенте; обновите
связанные ветви (вызовы sdk.GetFileMtimes и последующую логику обработки
результатов) чтобы полностью пропускать mtime-проверки и обработку
ошибок/логирования на сервере; проверьте все вхождения sdk.GetFileMtimes в
handleContextInject и внесите одинаковую защиту.

---

Duplicate comments:
In `@internal/backfill/extract/extract.go`:
- Around line 177-183: BuildUserPrompt's current sanitization only replaces a
few exact-tag strings and misses opening tags, closing </observation>, and
different casing, leaving a prompt-injection vector; update the sanitization of
the transcript variable in BuildUserPrompt to normalize/escape all relevant
XML-like tags case-insensitively (at minimum both opening and closing variants
of session_transcript, observations, and observation) — e.g., perform
case-insensitive replacements or use a regex that matches
(?i)</?session_transcript>, (?i)</?observations>, (?i)</?observation> and
replace them with safe bracketed or escaped equivalents so no angle-bracket
boundaries remain in the final formatted prompt.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 7426233b-bbab-4d63-bcfb-07275411e746

📥 Commits

Reviewing files that changed from the base of the PR and between 52ef629 and 04295bc.

📒 Files selected for processing (5)

internal/backfill/extract/extract.go
internal/worker/handlers_backfill.go
internal/worker/handlers_context.go
internal/worker/middleware.go
internal/worker/service.go

coderabbitai · 2026-03-16T23:00:32Z

+	// Server-side: ignore client-provided cwd to prevent filesystem probing (S9-003).
+	// File mtime staleness checks are only meaningful on the client; the server has no
+	// access to client filesystems.
+	cwd = ""



⚠️ Potential issue | 🟠 Major

После cwd = "" сервер всё ещё выполняет mtime-проверки файлов.

Несмотря на новый комментарий, в handleContextInject вызовы sdk.GetFileMtimes(...) остаются активными (Line 828, Line 916, Line 947) без условия cwd != "". Это противоречит заявленной модели и оставляет нежелательный server-side probing путь.

💡 Предложение правки

- if len(obs.FileMtimes) > 0 { + if len(obs.FileMtimes) > 0 && cwd != "" { @@ - if len(obs.FileMtimes) > 0 { + if len(obs.FileMtimes) > 0 && cwd != "" { @@ - if len(obs.FileMtimes) > 0 { + if len(obs.FileMtimes) > 0 && cwd != "" {

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@internal/worker/handlers_context.go` around lines 780 - 784, После присвоения cwd = "" функция handleContextInject всё ещё вызывает sdk.GetFileMtimes(...) и выполняет проверки mtime на сервере; оградите все вызовы sdk.GetFileMtimes в handleContextInject условием, например if cwd != "" { ... }, чтобы эти блоки выполнялись только на клиенте; обновите связанные ветви (вызовы sdk.GetFileMtimes и последующую логику обработки результатов) чтобы полностью пропускать mtime-проверки и обработку ошибок/логирования на сервере; проверьте все вхождения sdk.GetFileMtimes в handleContextInject и внесите одинаковую защиту.

…ndant auth middleware - AG06-005: replace brittle denylist with regex matching all prompt-boundary XML tags - Remove duplicate tokenAuth.Middleware in admin route group (already applied globally)

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (1)

internal/backfill/extract/extract.go (1)

15-15: ⚠️ Potential issue | 🟠 Major

Санитизация остаётся обходной из-за чувствительности regex к регистру.

На Line 15 паттерн не включает (?i), поэтому </SESSION_TRANSCRIPT> / <Observations> не будут заменены и обойдут защиту при сборке prompt (Line 184-187).

Предлагаемое исправление

-var xmlTagRe = regexp.MustCompile(`</?(?:session_transcript|observations?|metadata|no_observations_found)[^>]*>`)
+var xmlTagRe = regexp.MustCompile(`(?i)</?(?:session_transcript|observations?|metadata|no_observations_found)\b[^>]*>`)

#!/bin/bash
set -euo pipefail

echo "1) Текущий regex:"
rg -n 'xmlTagRe\s*=\s*regexp\.MustCompile' internal/backfill/extract/extract.go -A1

echo
echo "2) Проверка наличия case-insensitive флага (?i):"
rg -n '\(\?i\)' internal/backfill/extract/extract.go || true

echo
echo "3) Проверка тестов на mixed/upper case теги в backfill/extract:"
rg -n 'SESSION_TRANSCRIPT|Observations|BuildUserPrompt|xmlTagRe' internal/backfill/extract -g '*_test.go' || true

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@internal/backfill/extract/extract.go` at line 15, The regex assigned to
xmlTagRe (regexp.MustCompile) is case-sensitive and misses mixed/uppercase XML
tags; update the pattern to be case-insensitive by adding the (?i) inline flag
(e.g.,
regexp.MustCompile(`(?i)</?(?:session_transcript|observations?|metadata|no_observations_found)[^>]*>`))
so tags like </SESSION_TRANSCRIPT> or <Observations> are matched and sanitized
before prompt assembly (see BuildUserPrompt usage around prompt construction).
Ensure xmlTagRe is the only change and run/adjust existing tests that assert tag
stripping to include mixed/upper-case examples.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@internal/worker/service.go`:
- Around line 2007-2008: The warning condition in Start() can panic if
s.tokenAuth is nil and misclassifies IPv6 loopback (::1) as non-loopback; update
the check around s.tokenAuth.IsEnabled() to first guard against nil (e.g., if
s.tokenAuth == nil || !s.tokenAuth.IsEnabled()) and replace the simple host
string checks with proper IP loopback detection using net.ParseIP(host) and
ip.IsLoopback() so that log.Warn().Str(...).Msg(...) only runs when
authentication is disabled (or tokenAuth nil) and the parsed host is not a
loopback address.

---

Duplicate comments:
In `@internal/backfill/extract/extract.go`:
- Line 15: The regex assigned to xmlTagRe (regexp.MustCompile) is case-sensitive
and misses mixed/uppercase XML tags; update the pattern to be case-insensitive
by adding the (?i) inline flag (e.g.,
regexp.MustCompile(`(?i)</?(?:session_transcript|observations?|metadata|no_observations_found)[^>]*>`))
so tags like </SESSION_TRANSCRIPT> or <Observations> are matched and sanitized
before prompt assembly (see BuildUserPrompt usage around prompt construction).
Ensure xmlTagRe is the only change and run/adjust existing tests that assert tag
stripping to include mixed/upper-case examples.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: cd4c8573-95bd-4076-99a1-9988fb77cca6

📥 Commits

Reviewing files that changed from the base of the PR and between 04295bc and ae1a5d5.

📒 Files selected for processing (2)

internal/backfill/extract/extract.go
internal/worker/service.go

coderabbitai · 2026-03-16T23:46:55Z

+	if !s.tokenAuth.IsEnabled() && host != "127.0.0.1" && host != "localhost" {
+		log.Warn().Str("bind", fmt.Sprintf("%s:%d", host, port)).Msg("auth: server listening without authentication on non-loopback address — set ENGRAM_API_TOKEN to secure")


⚠️ Potential issue | 🟡 Minor

Условие предупреждения в Start() стоит усилить (nil + IPv6 loopback).

На Line 2007 возможен panic при s.tokenAuth == nil, а ::1 сейчас ошибочно считается non-loopback. Лучше добавить nil-guard и нормальную проверку loopback-адреса.

💡 Предлагаемое исправление

import ( "context" "fmt" + "net" "net/http" "os" "sync" @@ - // Warn if auth is disabled and server is reachable beyond loopback - if !s.tokenAuth.IsEnabled() && host != "127.0.0.1" && host != "localhost" { + // Warn if auth is disabled and server is reachable beyond loopback + isLoopback := host == "localhost" + if ip := net.ParseIP(host); ip != nil && ip.IsLoopback() { + isLoopback = true + } + if s.tokenAuth != nil && !s.tokenAuth.IsEnabled() && !isLoopback { log.Warn().Str("bind", fmt.Sprintf("%s:%d", host, port)).Msg("auth: server listening without authentication on non-loopback address — set ENGRAM_API_TOKEN to secure") }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

if !s.tokenAuth.IsEnabled() && host != "127.0.0.1" && host != "localhost" {

log.Warn().Str("bind", fmt.Sprintf("%s:%d", host, port)).Msg("auth: server listening without authentication on non-loopback address — set ENGRAM_API_TOKEN to secure")

import (

"context"

"fmt"

"net"

"net/http"

"os"

"sync"

// ... rest of imports

)

// ... other code ...

// Warn if auth is disabled and server is reachable beyond loopback

isLoopback := host == "localhost"

if ip := net.ParseIP(host); ip != nil && ip.IsLoopback() {

isLoopback = true

}

if s.tokenAuth != nil && !s.tokenAuth.IsEnabled() && !isLoopback {

log.Warn().Str("bind", fmt.Sprintf("%s:%d", host, port)).Msg("auth: server listening without authentication on non-loopback address — set ENGRAM_API_TOKEN to secure")

}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@internal/worker/service.go` around lines 2007 - 2008, The warning condition in Start() can panic if s.tokenAuth is nil and misclassifies IPv6 loopback (::1) as non-loopback; update the check around s.tokenAuth.IsEnabled() to first guard against nil (e.g., if s.tokenAuth == nil || !s.tokenAuth.IsEnabled()) and replace the simple host string checks with proper IP loopback detection using net.ParseIP(host) and ip.IsLoopback() so that log.Warn().Str(...).Msg(...) only runs when authentication is disabled (or tokenAuth nil) and the parsed host is not a loopback address.

Chokidar (native file watcher) was imported at module top level, causing jiti to resolve the entire native dependency tree during plugin discovery. Dynamic import defers this to service.start(), after gateway port is already listening.

thebtf added 2 commits March 17, 2026 01:48

security: sanitize transcript in LLM extraction prompts (AG06-005)

04295bc

Strip XML boundary tags from transcript content before embedding in LLM prompts to prevent prompt injection via crafted session content.

gemini-code-assist Bot reviewed Mar 16, 2026

View reviewed changes

coderabbitai Bot requested changes Mar 16, 2026

View reviewed changes

gemini-code-assist Bot reviewed Mar 16, 2026

View reviewed changes

coderabbitai Bot requested changes Mar 16, 2026

View reviewed changes

fix: address PR review — regex-based prompt sanitization, remove redu…

ae1a5d5

…ndant auth middleware - AG06-005: replace brittle denylist with regex matching all prompt-boundary XML tags - Remove duplicate tokenAuth.Middleware in admin route group (already applied globally)

coderabbitai Bot requested changes Mar 16, 2026

View reviewed changes

thebtf merged commit d78640d into main Mar 17, 2026
1 of 2 checks passed

thebtf temporarily deployed to npm March 17, 2026 07:40 — with GitHub Actions Inactive

This was referenced Apr 13, 2026

review: post-v4.0.0 hotfixes (cmux, constraints, logging, plugin auto-binary) #150

Closed

chore(cleanup): remove legacy MCP/CLI entry points + 6 orphaned backfill funcs (Bucket D / Option A) #176

Merged

coderabbitai Bot mentioned this pull request Apr 24, 2026

refactor: CR-002 post-release cleanup — dead features, env vars, auth-skip-local #203

Merged

thebtf deleted the feat/security-hardening branch May 7, 2026 06:34

coderabbitai Bot mentioned this pull request May 7, 2026

chore: add brand assets + sync infrastructure #212

Merged

		if !s.tokenAuth.IsEnabled() && host != "127.0.0.1" && host != "localhost" {
		log.Warn().Str("bind", fmt.Sprintf("%s:%d", host, port)).Msg("auth: server listening without authentication on non-loopback address — set ENGRAM_API_TOKEN to secure")

-	if !s.tokenAuth.IsEnabled() && host != "127.0.0.1" && host != "localhost" {
-		log.Warn().Str("bind", fmt.Sprintf("%s:%d", host, port)).Msg("auth: server listening without authentication on non-loopback address — set ENGRAM_API_TOKEN to secure")
+import (
+	"context"
+	"fmt"
+	"net"
+	"net/http"
+	"os"
+	"sync"
+	// ... rest of imports
+)
+// ... other code ...
+	// Warn if auth is disabled and server is reachable beyond loopback
+	isLoopback := host == "localhost"
+	if ip := net.ParseIP(host); ip != nil && ip.IsLoopback() {
+		isLoopback = true
+	}
+	if s.tokenAuth != nil && !s.tokenAuth.IsEnabled() && !isLoopback {
+		log.Warn().Str("bind", fmt.Sprintf("%s:%d", host, port)).Msg("auth: server listening without authentication on non-loopback address — set ENGRAM_API_TOKEN to secure")
+	}

Conversation

thebtf commented Mar 16, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

HIGH fixes

MEDIUM fixes

Kept outside auth (public, read-only)

Test plan

Summary by CodeRabbit

Примечания к выпуску

Uh oh!

coderabbitai Bot commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review failed

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

Uh oh!

gemini-code-assist Bot commented Mar 16, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

thebtf commented Mar 16, 2026

Uh oh!

thebtf commented Mar 16, 2026

Uh oh!

thebtf commented Mar 16, 2026

Uh oh!

coderabbitai Bot commented Mar 16, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 16, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

thebtf commented Mar 16, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Mar 16, 2026 •

edited

Loading