fix: post-merge cleanup — vault refactor + secret detection by thebtf · Pull Request #4 · thebtf/engram

thebtf · 2026-03-12T00:20:04Z

Summary

C2 tech debt: Move vault from global sync.Once to Server struct field for test isolation
Secret detection: Activate privacy.RedactSecrets() at store_memory and hook event ingestion boundaries (warn-and-redact pattern)
TECHNICAL_DEBT.md: Mark P1 secrets item as RESOLVED

Changes

internal/mcp/server.go: Add vault, vaultInitErr, vaultOnce fields to Server
internal/mcp/tools_credential.go: Convert getVault() from free function to Server method, remove globals
internal/mcp/tools_memory.go: Add secret redaction before store
internal/worker/handlers_ingest.go: Add secret redaction for tool input/result strings
TECHNICAL_DEBT.md: Updated

Test plan

go build ./internal/mcp/ ./internal/worker/ ./cmd/worker/ passes
go vet clean on modified packages
Existing credential tool behavior unchanged (lazy vault init preserved)
Secrets in store_memory content are redacted, not rejected

Summary by CodeRabbit

Примечания к выпуску

Улучшения безопасности
- Реализована автоматическая обнаружение и маскирование конфиденциальных данных перед сохранением и обработкой.
- Система теперь выдает предупреждение вместо ошибки при попытке сохранить конфиденциальные данные, автоматически очищая содержимое.
- Добавлена предварительная очистка конфиденциальных данных на этапе приема перед основной обработкой конвейера.
Управление учетными данными
- Расширена поддержка управления хранилищем учетных данных с улучшенным отслеживанием источников ключей.

Replace global vaultOnce/sharedVault/vaultInitErr with Server struct fields. Vault is still lazily initialized on first credential tool use, but now scoped to the server instance for better test isolation.

Wire existing privacy.RedactSecrets() into store_memory handler and hook event ingestion (handlers_ingest.go). Warn-and-redact pattern: log detection at WARN level, redact secrets before storage. Resolves TECHNICAL_DEBT.md P1: privacy/secrets.go dead code activation.

coderabbitai · 2026-03-12T00:20:17Z

Caution

Review failed

Pull request was closed or merged during review

Walkthrough

Добавлена поддержка хранилища (vault) к серверу MCP путем переноса состояния из глобальных переменных в поля структуры Server. Реализована обнаружение и удаление секретных данных в нескольких точках обработки: при сохранении памяти и при приеме событий.

Changes

Cohort / File(s)	Summary
Документация технического долга `TECHNICAL_DEBT.md`	Добавлены разделы о разрешении проблем обнаружения приватных данных с реализацией функций ContainsSecrets(), RedactSecrets(), SanitizeObservation(), а также расширены примечания о хранилище учетных данных с документированием источника ключей и проблем шифрования.
Интеграция хранилища в структуру Server `internal/mcp/server.go`	Добавлены три новых поля для поддержки хранилища: `vault`, `vaultInitErr`, `vaultOnce`.
Рефакторинг доступа к хранилищу `internal/mcp/tools_credential.go`	Преобразована функция доступа к хранилищу из глобальной функции в метод Server; сигнатура изменена на `(s *Server) getVault()`, все места вызова обновлены для использования метода получателя.
Обработка секретов при сохранении памяти `internal/mcp/tools_memory.go`	Изменено поведение при обнаружении секретов с отклонения запроса на предупреждение и удаление секретов перед сохранением.
Удаление секретов в конвейере приема `internal/worker/handlers_ingest.go`	Добавлен шаг предварительной обработки для проверки и удаления секретов из строк tool_input и tool_result перед обработкой конвейером.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Poem

🐰 Хранилище теперь в структуре живёт,
Секреты везде редактируются, поёт!
От памяти к ивентам, везде стража стоит,
Глобалы уходят, метод мне служит—
Безопаснее станет, куда ни глядишь! 🔐

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The pull request title accurately describes the main changes: vault refactoring (moving from global state to Server struct) and secret detection activation (warn-and-redact pattern in memory storage and event ingestion).
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feat/post-merge-cleanup

Warning

There were issues while running some tools. Please review the errors and either fix the tool's configuration or disable the tool if it's a critical failure.

🔧 golangci-lint (2.5.0)

Error: can't load config: unsupported version of the configuration: "" See https://golangci-lint.run/docs/product/migration-guide for migration instructions
The command is terminated due to an error: can't load config: unsupported version of the configuration: "" See https://golangci-lint.run/docs/product/migration-guide for migration instructions

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

gemini-code-assist · 2026-03-12T00:20:29Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the system's robustness and security posture. It improves the testability of the vault component by isolating its state within the server instance and strengthens data privacy by implementing proactive secret redaction at critical data entry points. These changes ensure that sensitive information is handled more securely and that the system's architecture is more maintainable.

Highlights

Vault Refactor: The vault initialization mechanism has been refactored from a global sync.Once to a field within the Server struct, improving test isolation and reducing global state.
Secret Detection Activated: Secret detection and redaction have been activated at key event ingestion boundaries and within the store_memory handler. Content identified as containing secrets will now be redacted with a warning, rather than being rejected outright.
Technical Debt Resolution: The TECHNICAL_DEBT.md file has been updated to mark a P1 item related to unused privacy/secret detection functions as resolved, reflecting their integration into production code paths.

Changelog

TECHNICAL_DEBT.md
- Added a new section titled "Privacy / Secret Detection" to document the resolution of a P1 technical debt item.
- Marked the P1 item "privacy/secrets.go functions never called from production code" as RESOLVED.
- Provided details on the resolution, including changes in internal/mcp/tools_memory.go and internal/worker/handlers_ingest.go.
internal/mcp/server.go
- Imported the sync and crypto packages.
- Added vault *crypto.Vault, vaultInitErr error, and vaultOnce sync.Once fields to the Server struct.
internal/mcp/tools_credential.go
- Removed the global sharedVault, vaultInitErr, and vaultOnce variables.
- Converted the getVault() function into a method (s *Server) getVault().
- Updated calls to getVault() within handleStoreCredential, handleGetCredential, and handleVaultStatus to use s.getVault().
internal/mcp/tools_memory.go
- Modified the logic in handleStoreMemory to redact secrets using privacy.RedactSecrets() and log a warning, instead of returning an error when privacy.ContainsSecrets() is true.
internal/worker/handlers_ingest.go
- Imported the privacy package.
- Added logic to check for and redact secrets in toolInputStr and toolResultStr using privacy.ContainsSecrets() and privacy.RedactSecrets() before further pipeline processing.

Activity

No human activity has been recorded on this pull request yet.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces two significant improvements. First, it refactors the vault initialization by moving it from a global singleton to a field within the Server struct. This is an excellent change that improves encapsulation and test isolation. Second, it activates secret detection and redaction at key data ingestion points (store_memory and ingestEvent), changing the behavior from rejection to a more robust warn-and-redact pattern. This enhances data privacy without causing data loss. The changes are well-implemented. I have one suggestion to reduce code duplication in the secret redaction logic.

gemini-code-assist · 2026-03-12T00:22:05Z

+	// Redact secrets from tool input/result before any pipeline processing.
+	if privacy.ContainsSecrets(toolInputStr) {
+		log.Warn().Str("tool", req.ToolName).Msg("ingest: tool_input contains secrets — redacting before pipeline processing")
+		toolInputStr = privacy.RedactSecrets(toolInputStr)
+	}
+	if privacy.ContainsSecrets(toolResultStr) {
+		log.Warn().Str("tool", req.ToolName).Msg("ingest: tool_result contains secrets — redacting before pipeline processing")
+		toolResultStr = privacy.RedactSecrets(toolResultStr)
+	}


The logic for redacting secrets in toolInputStr and toolResultStr is duplicated. You can refactor this into a local helper function to improve readability and maintainability. This would also make it easier to add redaction for more fields in the future.

Suggested change

// Redact secrets from tool input/result before any pipeline processing.

if privacy.ContainsSecrets(toolInputStr) {

log.Warn().Str("tool", req.ToolName).Msg("ingest: tool_input contains secrets — redacting before pipeline processing")

toolInputStr = privacy.RedactSecrets(toolInputStr)

}

if privacy.ContainsSecrets(toolResultStr) {

log.Warn().Str("tool", req.ToolName).Msg("ingest: tool_result contains secrets — redacting before pipeline processing")

toolResultStr = privacy.RedactSecrets(toolResultStr)

}

// Redact secrets from tool input/result before any pipeline processing.

redactAndLog := func(s, fieldName string) string {

if privacy.ContainsSecrets(s) {

log.Warn().Str("tool", req.ToolName).Msgf("ingest: %s contains secrets — redacting before pipeline processing", fieldName)

return privacy.RedactSecrets(s)

}

return s

}

toolInputStr = redactAndLog(toolInputStr, "tool_input")

toolResultStr = redactAndLog(toolResultStr, "tool_result")

…ncation Extract splitIntoChunks, inferType, loadMarker, saveMarker, discoverMemoryFiles, safeReadFile, fileExists into utils/memory-files.ts for reuse by both memory-migrate tool and file-watcher service. Remove artificial 900-char content truncation on import — engram server stores full TEXT without limits. Fix BUG #4: dry run output now shows summary + first 5 chunks instead of flooding with all chunks.

* feat: accept agent_id param in context endpoints Allow OpenClaw agents (which lack filesystem context) to use agent_id as project scope in handleSearchByPrompt and handleContextInject. Backward-compatible: existing Claude Code hooks using project param are unaffected. * feat: OpenClaw engram plugin — persistent memory for AI gateway agents TypeScript plugin connecting OpenClaw's AI gateway to engram via REST API. Core features: - Dual-tier context injection (session-level static + per-turn dynamic) - Automatic self-learning via tool event ingestion and transcript backfill - 3-strike passive availability detection with 60s cooldown - agentId-first identity with git remote fallback - Zod-validated config with JSON Schema export Hooks: session_start, before_prompt_build, after_tool_call, before_compaction, session_end Tools: engram_search, engram_remember, engram_decisions Commands: /memory (status), /remember (quick store) Context formatter ported from plugin/engram/hooks/user-prompt.js: credential filter, Jaccard title dedup, type grouping, token budget, XML-tag rendering. * refactor: align OpenClaw engram plugin with real SDK types - Rewrite types/openclaw.ts with verified SDK shapes: register(api), AnyAgentTool, ToolFactory, PluginLogger, 24 hook names, CliProgram - Rewrite index.ts: id-based plugin, api.on() hooks, tool factory pattern, registerCli for 'memory' CLI subcommands - Refactor 3 tool files to factory pattern with TypeBox schemas - Add memory_search (alias), memory_store (text/content compat), memory_forget (bulk delete), memory_get (dual-mode: fs + engram) - Update 5 hooks: optional event fields, PluginLogger injection - Update 2 commands: OpenClawPluginCommandDefinition, name without slash - Add @sinclair/typebox dep, move zod to devDependencies - Add bulkDelete method to EngramRestClient - Version bump to 0.2.0, manifest updated with 7 tools + CLI section * feat: add memory_migrate tool for importing local memory files Adds memory_migrate tool, /migrate slash command, and `openclaw memory migrate` CLI subcommand. Discovers MEMORY.md + memory/**/*.md, splits by ## headers, bulk-imports into engram with SHA256-based idempotency marker. * fix: address code review findings in memory_migrate - Path traversal: validate resolved path stays within workspaceDir - Marker idempotency: only save marker on full success, not partial failure - Symlink safety: skip symlinks + max depth guard in findMdFiles - Code fence awareness: track fenced blocks to avoid false ## splits - Truncation visibility: warn about content truncation in dry-run and report * fix: address remaining low-priority review findings - Schema drift guard: compile-time check that Zod and TypeBox field names match - Marker race condition: atomic write via temp file + rename - CLI tool caching: reuse migrate tool instance instead of recreating per call * fix: correct plugin discovery — rename to "engram" and add openclaw extensions entry OpenClaw discovery ignores package.json "main" field. It uses: 1. package.json "openclaw.extensions" array for entry point resolution 2. Fallback to index.ts/index.js in root Added "openclaw": { "extensions": ["dist/index.js"] } and renamed package name from "openclaw-engram-plugin" to "engram" so idHint matches manifest id and config key. * fix: make register() synchronous and use SDK command handler signature Bug 1: register() was async — OpenClaw ignores Promise returns from register(), logging "async registration is ignored". Made synchronous. Bug 2: Commands used { execute(args, ctx) } shape but SDK expects { handler(ctx) } where handler is typeof "function". SDK checks `typeof command.handler !== "function"` and rejects with "Command handler must be a function". Also updated PluginCommandContext and PluginCommandResult types to match real SDK (ctx.args is raw string, result uses { text } not { output }). * fix: resolve workspaceDir for migrate command via api.resolvePath * fix: use config.workspaceDir with ~/.openclaw/workspace fallback for migrate command * refactor: extract shared memory file utilities and remove content truncation Extract splitIntoChunks, inferType, loadMarker, saveMarker, discoverMemoryFiles, safeReadFile, fileExists into utils/memory-files.ts for reuse by both memory-migrate tool and file-watcher service. Remove artificial 900-char content truncation on import — engram server stores full TEXT without limits. Fix BUG #4: dry run output now shows summary + first 5 chunks instead of flooding with all chunks. * feat: add file watcher service for live memory sync Add chokidar-based FileWatcherService that watches MEMORY.md and memory/**/*.md in the agent workspace. On file changes, debounces 1500ms, computes SHA256 hash, and bulk-imports new/changed chunks into engram. Reuses shared chunking and marker logic from utils/memory-files.ts. Replaces memory-core's built-in chokidar watcher, enabling engram to fully occupy the exclusive memory plugin slot. Review fixes applied: - stopped flag prevents callbacks after stop() - inFlight collision reschedules instead of dropping events - projectId cached in constructor (avoids git execSync per sync) * fix: use service.id instead of service.name to match SDK expectation * fix: wrap bulkImport body in {project, observations} to match server API * fix: replace invalid observation type 'context' with 'change' Engram API only accepts: bugfix, feature, refactor, discovery, decision, change. The plugin was using 'context' as default type which caused bulk-import to reject observations with "invalid type 'context'". * fix: address code review findings from CodeRabbit and Gemini - extractOrigin preserves URL subpath (client.ts) - Replace hardcoded 'cli' project fallback with workspace-derived ID - force flag no longer wipes migration marker state (memory-migrate.ts) - Path traversal protection via relative() instead of startsWith - Use event.sessionId instead of agentId for session tracking - Enforce search limit parameter (engram-search.ts) - Restrict memory_get to .md files only (memory-get.ts) - Safe serialization + .catch() for fire-and-forget calls - Fix tool name references in formatter reminder text - Validate URL scheme is http/https (config.ts) - Log warning on marker save failure (memory-files.ts) * fix: address code review findings from CodeRabbit and Gemini (round 2) - package.json: align @types/node to ^18.19.0 to match engines.node >=18 - client.ts: return null instead of {} for empty response body; fix extractOrigin fallback to preserve path segments - identity.ts: add module-level memoization cache for getGitRemoteID to avoid blocking execSync on hot paths; derive projectId from remote URL (not local dir name) for stable cross-checkout IDs - formatter.ts: fix token budget bypass — oversized observations now skipped with continue instead of always inserting the first entry - before-prompt-build.ts: wrap searchContext in try/catch to prevent hook lifecycle disruption on network errors - session-end.ts: use event.sessionId only (no agentId fallback) to avoid conflating backfill across sessions - availability.ts: fix recordSuccess wasUnavailable detection (use unavailableSince directly, not isAvailable); fix recordFailure to re-trip circuit in half-open state - openclaw.plugin.json: add uiHints sensitive:true to token field --------- Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com>

thebtf added 2 commits March 12, 2026 03:19

refactor: move vault from global state to Server struct (C2 tech debt)

fe77649

Replace global vaultOnce/sharedVault/vaultInitErr with Server struct fields. Vault is still lazily initialized on first credential tool use, but now scoped to the server instance for better test isolation.

gemini-code-assist Bot reviewed Mar 12, 2026

View reviewed changes

thebtf merged commit dcb9fe0 into main Mar 12, 2026
1 of 2 checks passed

This was referenced Mar 16, 2026

v0.5.0: vault integration, legacy cleanup, chuck features #9

Merged

security: auth consolidation, vault singleton, cwd hardening, LLM prompt sanitization #10

Merged

feat: include SHA-256 hash prefix in redacted secrets (FR-9) #12

Merged

thebtf deleted the feat/post-merge-cleanup branch May 7, 2026 06:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: post-merge cleanup — vault refactor + secret detection#4

fix: post-merge cleanup — vault refactor + secret detection#4
thebtf merged 2 commits into
mainfrom
feat/post-merge-cleanup

thebtf commented Mar 12, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Mar 12, 2026 •

edited

Loading

Review failed

Uh oh!

gemini-code-assist Bot commented Mar 12, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Mar 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

thebtf commented Mar 12, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Test plan

Summary by CodeRabbit

Примечания к выпуску

Uh oh!

coderabbitai Bot commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review failed

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

gemini-code-assist Bot commented Mar 12, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

thebtf commented Mar 12, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Mar 12, 2026 •

edited

Loading