feat: plugin tool consolidation — all phases (FR-1 through FR-6) by thebtf · Pull Request #112 · thebtf/engram

thebtf · 2026-03-28T00:18:01Z

Summary

Phase 1: Server — FR-1 (Remove 7 Redundant Tools)

Remove 7 tools from tools/list (68 → 61). Dispatch aliases retained for backward compat.
Removed: get_context_timeline, get_timeline_by_query, get_recent_context, find_by_file_context, get_observation_relationships, get_graph_neighbors, doc_update

Phase 2: OpenClaw — FR-2 (Bug Fixes)

engram_decisions now uses /api/decisions/search (was searchContext + client-side filter)
memory_forget defaults to suppress (reversible) instead of archive. permanent=true for archive.
Server: add suppress action to bulk-status endpoint
Bump openclaw-engram to 2.0.9

Phase 3: OpenClaw — FR-3 (Expand Tools: 8 → 17)

9 new tools: engram_rate, engram_suppress, engram_outcome, engram_find_by_file, engram_timeline, engram_changes, engram_how_it_works, engram_vault_store, engram_vault_get
All tool descriptions include trigger conditions (NFR-3)
New client methods: rateObservation, setSessionOutcome, getFileContext, getTimeline, storeCredential, getCredential
Bump openclaw-engram to 2.0.10

Phase 5: CC Plugin — FR-5+FR-6

stop.js: Replace injected-observations API with single retrospective API call
statusline.js: Add effectiveness metric with 60s cache (shows "eff:72%" or "eff:--")

Deferred: Phase 4 (OpenClaw lifecycle hooks — FR-4)

session_end outcome detection, utility tracking, before_tool_call file context
Requires more complex heuristic design — separate PR

Test plan

go build ./... passes
go test ./internal/mcp/... passes
npx tsc --noEmit passes for openclaw-engram
tools/list returns 61 tools (was 68)
Removed tool names still work via dispatch
New OpenClaw tools compile and are registered (17 total)
Statusline shows effectiveness metric

Summary by CodeRabbit

Примечания к выпуску

Новые функции
- Добавлены инструменты: подавление наблюдений, рейтинг, поиск контекста по файлу, таймлайн, запись исхода сессии и хранилище учётных данных; пресеты «changes» и «how_it_works».
Улучшения
- Хук before-tool-call подставляет контекст файла перед редактирующими операциями.
- Статусная строка показывает метку эффективности с 60‑секундным кэшем.
- Фиксированное уменьшение видимого списка MCP-инструментов с сохранением обратной совместимости.
Исправления
- Добавлено массовое действие "suppress" для подавления наблюдений и инвалидация счётчиков/видимости при подавлении.
Документация
- Добавлены отчёты, спецификации и план консолидации инструментов.

- Gap audit report: plugin vs API analysis (68 MCP tools, 130 REST endpoints) - New spec: plugin-tool-consolidation (6 FR, 4 NFR, 6 US) - Plan: 5 phases, 34 tasks, analyze remediation applied - Closed old mcp-tools-refactoring spec (FR7/FR8 → TECHNICAL_DEBT)

Remove from tools/list: get_context_timeline, get_timeline_by_query, get_recent_context, find_by_file_context, get_observation_relationships, get_graph_neighbors, doc_update. All 7 tools remain callable via dispatch aliases in handleCallTool for backward compatibility. Reduces tool count from 68 to 61. Updates tests to match new tool set.

- engram_decisions now uses /api/decisions/search instead of searchContext + client-side type filter (B1 from audit) - memory_forget defaults to suppress (reversible) instead of archive. Add permanent=true parameter for permanent archival (B2 from audit) - Add suppressObservation() client method using bulk-status suppress action - Add "suppress" action to server bulk-status handler - Bump openclaw-engram to 2.0.9

gemini-code-assist · 2026-03-28T00:18:05Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

coderabbitai · 2026-03-28T00:18:16Z

Caution

Review failed

Pull request was closed or merged during review

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

Walkthrough

Добавлены спецификации и отчёты по консолидации MCP‑инструментов; сервер сократил рекламируемый список инструментов, сохранив алиасы; OpenClaw клиент расширен новыми REST‑методами и инструментами; добавлено bulk‑действие suppress в воркере; плагины получили before_tool_call, session‑end outcome запись и statusline эффективность.

Changes

Cohort / File(s)	Summary
Документация и планы `/.agent/reports/plugin-api-gap-audit-2026-03-28.md`, `/.agent/specs/mcp-tools-refactoring.md`, `/.agent/specs/plugin-tool-consolidation/plan.md`, `/.agent/specs/plugin-tool-consolidation/spec.md`, `/.agent/specs/plugin-tool-consolidation/tasks.md`, `TECHNICAL_DEBT.md`	Добавлены аудиты, спецификации, план и задачи по консолидации MCP‑инструментов, правила поведения adoption и namespace‑префиксы.
MCP регистрация и тесты `internal/mcp/server.go`, `internal/mcp/server_test.go`	Удалены из выдачи `tools/list` несколько инструментов; dispatch‑алиасы сохранены; тесты обновлены для новых ожиданий.
Worker: bulk suppress `internal/worker/handlers_import_export.go`	Добавлено действие `"suppress"` в `handleBulkStatusUpdate`, выполняющее UPDATE `observations.is_suppressed = TRUE` по id, собирающее успехи/ошибки и инвалидацию кэша видимости.
OpenClaw клиент: API методы и версия `plugin/openclaw-engram/src/client.ts`, `plugin/openclaw-engram/package.json`	Клиент расширен методами (`suppressObservation`, `rateObservation`, `setSessionOutcome`, `getFileContext`, `getTimeline`, `storeCredential`, `getCredential`) и поддержкой `preset` в `searchContext`; пакетная версия обновлена.
OpenClaw: новые инструменты и hooks `plugin/openclaw-engram/src/index.ts`, `plugin/openclaw-engram/src/tools/...`, `plugin/openclaw-engram/src/hooks/before-tool-call.ts`, `plugin/openclaw-engram/src/hooks/session-end.ts`	Добавлены и зарегистрированы новые инструменты (find‑by‑file, timeline, rate, suppress, outcome, presets, vault store/get и др.); реализован `before_tool_call` для подстановки контекста по файлу; `session-end` всегда вычисляет sessionId и делает fire‑and‑forget запись исхода.
OpenClaw: decisions и memory_forget `plugin/openclaw-engram/src/tools/engram-decisions.ts`, `plugin/openclaw-engram/src/tools/memory-forget.ts`	`engram-decisions` переключён на `searchDecisions` с изменённым форматированием; `memory_forget` по умолчанию выполняет suppress и поддерживает `permanent=true` для архивирования/удаления; входной id теперь строго положительный integer.
CC плагин: statusline и stop‑hook `plugin/engram/hooks/statusline.js`, `plugin/engram/hooks/stop.js`	Statusline добавляет получение `eff:` метки с 60s кэшем; stop‑handler использует `/api/sessions/{id}/injections` и отправляет utility на `/api/observations/{id}/utility` с более гибкой валидацией id.
DB: session outcome persistence `internal/db/gorm/session_store.go`	`UpdateSessionOutcome` теперь записывает outcome только если поле ранее пусто (условие `outcome IS NULL OR outcome = ''`).

Sequence Diagram(s)

sequenceDiagram
    participant Agent as OpenClaw Tool
    participant Client as EngramRestClient
    participant API as Server /api
    participant Worker as internal worker
    participant DB as Database

    Agent->>Client: suppressObservation(id)
    Client->>API: POST /api/observations/bulk-status { action: "suppress", ids: [id] }
    API->>Worker: route to handleBulkStatusUpdate
    Worker->>DB: UPDATE observations SET is_suppressed=TRUE WHERE id=?
    DB-->>Worker: result
    Worker-->>API: { updated, failed }
    API-->>Client: response
    Client-->>Agent: boolean success

sequenceDiagram
    participant Plugin as OpenClaw Hook
    participant Client as EngramRestClient
    participant API as Server /api
    participant Sessions as Sessions handler
    participant DB as Database

    Plugin->>Client: setSessionOutcome(sessionId, outcome, reason)
    Client->>API: POST /api/sessions/{id}/outcome
    API->>Sessions: record outcome
    Sessions->>DB: insert/update outcome
    DB-->>Sessions: result
    Sessions-->>API: success/failure
    API-->>Client: response
    Client-->>Plugin: boolean success

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

feat: causal chain linking — follows + prompted_by (FR-4, FR-5) #56 — сильно связан: добавление /api/context/by-file, getFileContext и before‑tool‑call hook (перекрытие функционала).
feat: closed-loop learning Phase 1 — outcome tracking + injection binding #83 — перекрытие по записи исхода сессии (/api/sessions/{id}/outcome) и связанному поведению плагинов.
feat: Phase 2+4 — Dashboard endpoints + MCP tool aliases #58 — пересечение по изменениям в регистрации/алиасах MCP‑инструментов в internal/mcp/server.go.

Поэма

🐰 В норке код шуршит, я строки считаю,
Инструменты сложил, алиасы оставил,
Тихо подавляю — в базе ставлю флаг,
Сессии шепчут: исход записал — так та‑дак!
Морковку укушу — и коммит готов. 🥕

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 15.63% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately summarizes the main change: a multi-phase plugin tool consolidation feature spanning phases 1-6 (FR-1 through FR-6), which is the primary scope of this PR.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/plugin-tool-consolidation

Warning

There were issues while running some tools. Please review the errors and either fix the tool's configuration or disable the tool if it's a critical failure.

🔧 golangci-lint (2.11.4)

Error: can't load config: unsupported version of the configuration: "" See https://golangci-lint.run/docs/product/migration-guide for migration instructions
The command is terminated due to an error: can't load config: unsupported version of the configuration: "" See https://golangci-lint.run/docs/product/migration-guide for migration instructions

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

thebtf · 2026-03-28T00:18:35Z

@coderabbitai review

thebtf · 2026-03-28T00:18:38Z

@gemini-code-assist review

coderabbitai · 2026-03-28T00:18:40Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

thebtf · 2026-03-28T00:18:41Z

@codex review

gemini-code-assist · 2026-03-28T00:18:41Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

…, vault (FR-3) Add 9 new tools to openclaw-engram: - engram_rate: rate observations as useful/not useful - engram_suppress: reversible soft-hide from search - engram_outcome: record session outcome for closed-loop learning - engram_find_by_file: check what engram knows BEFORE modifying a file - engram_timeline: fetch temporal observation context - engram_changes: search preset for recent code changes - engram_how_it_works: search preset for architecture/design - engram_vault_store: securely store encrypted credentials - engram_vault_get: retrieve and decrypt credentials All tool descriptions include trigger conditions (NFR-3). Add client methods: rateObservation, setSessionOutcome, getFileContext, getTimeline, storeCredential, getCredential. Add preset param to searchContext type. Bump openclaw-engram to 2.0.10. Total tools: 17 (was 8).

…R-5, FR-6) - stop.js: Replace /api/sessions/{id}/injected-observations + individual utility calls with single /api/sessions/{id}/injections (retrospective API). Fewer HTTP calls, enriched response with effectiveness data. - statusline.js: Add learning effectiveness indicator with 60s client cache. Shows "eff:72%" (high tier percentage) or "eff:--" when no data. Fetches /api/learning/effectiveness-distribution in parallel with stats.

coderabbitai

Actionable comments posted: 5

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

internal/worker/handlers_import_export.go (1)
646-650: ⚠️ Potential issue | 🟠 Major

Инвалидацию кэша нужно выполнять и для suppress.

is_suppressed влияет на видимость данных в выборках, но сейчас кэш сбрасывается только при archive. Это оставляет риск устаревших счётчиков/списков.
💡 Предлагаемая правка
-// Invalidate cache for archive action (affects observation counts)
-if req.Action == "archive" && updated > 0 {
+// Invalidate cache for actions that affect visibility/counts.
+if (req.Action == "archive" || req.Action == "suppress") && updated > 0 {
 	// No project info available, invalidate all caches
 	s.invalidateAllObsCountCache()
}
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@internal/worker/handlers_import_export.go` around lines 646 - 650, The cache
invalidation currently only triggers when req.Action == "archive" and updated >
0; extend this to also invalidate when req.Action == "suppress" (since changing
is_suppressed affects visibility and counts). Update the conditional around
s.invalidateAllObsCountCache() to check for req.Action == "archive" OR
req.Action == "suppress" (and still require updated > 0) so both archive and
suppress changes clear the observation count cache.

🧹 Nitpick comments (4)

plugin/openclaw-engram/src/tools/engram-decisions.ts (1)
65-80: Форматирование решений: реализация корректна, но отсутствует Markdown-экранирование.

Функция formatDecisions корректно обрабатывает все опциональные поля (title, narrative, concepts, rejected) с защитными проверками. Однако, если d.title или элементы concepts/rejected содержат спецсимволы Markdown (например, #, *, |), вывод может быть искажён.
♻️ Опционально: экранирование спецсимволов в заголовках
 function formatDecisions(decisions: DecisionSearchObservation[]): string {
   let out = '# Relevant Architectural Decisions\n\n';
   decisions.forEach((d, i) => {
-    out += `## ${i + 1}. ${d.title ?? 'Untitled'}\n`;
+    const title = (d.title ?? 'Untitled').replace(/[#*`]/g, '\\$&');
+    out += `## ${i + 1}. ${title}\n`;
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@plugin/openclaw-engram/src/tools/engram-decisions.ts` around lines 65 - 80,
The output from formatDecisions can be corrupted by unescaped Markdown
characters in fields like d.title, d.narrative and the arrays
d.concepts/d.rejected; add a small helper (e.g., escapeMarkdown) that
replaces/escapes Markdown metacharacters (such as # * _ ` | []() - > etc.) and
apply it to d.title, each element of concepts and rejected, and to d.narrative
(or at least sanitize line-leading characters) before building the string in
formatDecisions so the generated Markdown remains literal and stable.
plugin/openclaw-engram/src/tools/memory-forget.ts (1)
51-69: Непоследовательная валидация ID между путями suppress и archive.

Путь suppress (линии 61-64) явно проверяет, что ID — положительное число, и возвращает понятное сообщение Invalid observation ID. Путь archive (линия 52) передаёт строку в bulkDelete, которая молча отфильтровывает невалидные ID и возвращает deleted: 0.

Результат: для одного и того же невалидного ID ("abc") пользователь получает разные сообщения:

permanent=false → Invalid observation ID: abc

permanent=true → Observation not found or already archived: abc
♻️ Унификация валидации ID
     if (parsed.data.permanent) {
+      const numericId = Number(parsed.data.id);
+      if (Number.isNaN(numericId) || numericId <= 0) {
+        return `Invalid observation ID: ${parsed.data.id}`;
+      }
       const response = await client.bulkDelete([parsed.data.id]);
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@plugin/openclaw-engram/src/tools/memory-forget.ts` around lines 51 - 69,
Сделайте валидацию ID единой для обоих путей: перед проверкой
parsed.data.permanent распарсьте parsed.data.id в numericId и если
Number.isNaN(numericId) или numericId <= 0 возвращайте `Invalid observation ID:
${parsed.data.id}`; затем используйте numericId при вызове
client.suppressObservation и приведите аргументы client.bulkDelete к единому
виду (передавать строку или числовой ID последовательно — лучше передавать
numericId как число или как строку, но только после успешной валидации), чтобы
для одного и того же невалидного ID оба пути возвращали одно и то же сообщение и
корректно обрабатывали response.deleted в случае archive.
internal/mcp/server.go (1)
623-625: Синхронизируйте initialize-инструкции с новой витриной инструментов.

Вы убрали legacy-инструменты из tools/list, но в engramInstructions они по-прежнему рекламируются как основные. Это сохраняет когнитивный шум и частично нивелирует консолидацию.
♻️ Что лучше поменять в тексте инструкций
-- `get_recent_context`, `get_context_timeline`, `get_timeline_by_query`
+- `timeline` (mode: `recent` / `anchor` / `query`)

-- `get_observation_relationships`, `get_graph_neighbors`
+- `graph_query` (mode: `relationships` / `neighbors` / `related`)
Also applies to: 943-944
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@internal/mcp/server.go` around lines 623 - 625, Синхронизируйте текст
инструкций инициализации с новой витриной инструментов: в блоке initialize и в
переменной engramInstructions уберите упоминания legacy-инструментов, которые
удалены из tools/list (например ссылки/описания get_recent_context,
get_context_timeline, get_timeline_by_query), обновите описание так, чтобы
объявлялись только текущие инструменты и их алиасы/консолидации (timeline), и
примените те же правки для аналогичного текста в том же файле (также
затрагивается участок, отмеченный 943-944).
internal/mcp/server_test.go (1)
424-427: Добавьте отдельный позитивный тест для legacy alias dispatch.

После удаления старых имён из tools/list полезно явно проверить, что callTool с валидными аргументами для get_recent_context / get_context_timeline / get_timeline_by_query всё ещё работает по алиасам, как требует цель PR.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@internal/mcp/server_test.go` around lines 424 - 427, Add a new positive unit
test that verifies legacy alias dispatch still resolves to the current timeline
tools: callTool should accept the legacy names for get_recent_context,
get_context_timeline and get_timeline_by_query and succeed; implement a test in
server_test.go that constructs valid arguments for each legacy alias, invokes
callTool (same helper used by other tests) and asserts the returned result/no
error, referencing callTool and the target functions get_recent_context,
get_context_timeline, get_timeline_by_query so the test explicitly verifies
aliases still work after removing old names from tools/list.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In @.agent/specs/plugin-tool-consolidation/plan.md:
- Line 152: Обновите путь к спецификации на строке, где указан
'plugin-tool-consolidation/spec.md', чтобы он соответствовал общему формату и не
ломался при открытии из корня репозитория: замените его на
'.agent/specs/plugin-tool-consolidation/spec.md' в файле plan.md (строка с
упоминанием пути 'plugin-tool-consolidation/spec.md'), проверив, что ссылка
корректно открывается из корня проекта.
- Around line 215-221: The versioning statement under the "Version bumps
(Constitution `#7`, `#15` — unified versioning)" block is inconsistent: the line
"All versions stay in 2.0.x patch range" conflicts with the later note that the
release moves to 2.1.0 after Phase 4; reconcile these by choosing one canonical
strategy and updating the document accordingly—either change the “All versions
stay in 2.0.x” sentence to describe the planned bump to 2.1.0 after Phase 4, or
change the Phase 4/Phase 5 notes that mention 2.1.0 to remain within 2.0.x;
ensure you update both the Version bumps heading and the Phase 4/Phase 5 entries
(references: the "Version bumps (Constitution `#7`, `#15` — unified versioning)"
section and the Phase 1..Phase 5 lines mentioning openclaw-engram/server tags)
so the plan states a single, unambiguous versioning policy.

In @.agent/specs/plugin-tool-consolidation/spec.md:
- Around line 184-187: Update the two incorrect boundary-case bullets: change
the first bullet that mentions "suppress=true and permanent=true" to reference
only the existing memory_forget parameter "permanent=true" (matching FR-2b and
the memory_forget API), and fix the second bullet which incorrectly names the
hook "after_tool_call" to use the correct hook name "before_tool_call" (as
defined in FR-4c); ensure the bullets mention the symbols memory_forget and
before_tool_call so the reviewer can locate and verify the corrections.

In `@internal/worker/handlers_import_export.go`:
- Around line 630-639: В ветке "suppress" не учитывается значение RowsAffected:
вы всегда инкрементируете updated даже когда UPDATE успешно выполняется но не
затронуло строку (id не найден). Исправьте вызов
s.observationStore.GetDB().WithContext(ctx).Exec(...).Error на получение
результата в переменную (result := ...Exec(...)), проверяйте сначала
result.Error, а затем result.RowsAffected; увеличивайте updated только если
RowsAffected > 0, иначе увеличивайте failed и добавляйте в errors сообщение вида
"id X: not found" или аналогичное; см. использование req.IDs, updated, failed,
errors и метода Exec для поиска места правки.

In `@TECHNICAL_DEBT.md`:
- Line 152: Update the spec link in TECHNICAL_DEBT.md: find the occurrence of
"plugin-tool-consolidation/spec.md" (near the
".agent/reports/plugin-api-gap-audit-2026-03-28.md" reference) and replace it
with the rooted path ".agent/specs/plugin-tool-consolidation/spec.md" so the
link resolves consistently from the repository root; ensure any other
occurrences of the same spec reference in the file are updated to the same
rooted form for consistency.

---

Outside diff comments:
In `@internal/worker/handlers_import_export.go`:
- Around line 646-650: The cache invalidation currently only triggers when
req.Action == "archive" and updated > 0; extend this to also invalidate when
req.Action == "suppress" (since changing is_suppressed affects visibility and
counts). Update the conditional around s.invalidateAllObsCountCache() to check
for req.Action == "archive" OR req.Action == "suppress" (and still require
updated > 0) so both archive and suppress changes clear the observation count
cache.

---

Nitpick comments:
In `@internal/mcp/server_test.go`:
- Around line 424-427: Add a new positive unit test that verifies legacy alias
dispatch still resolves to the current timeline tools: callTool should accept
the legacy names for get_recent_context, get_context_timeline and
get_timeline_by_query and succeed; implement a test in server_test.go that
constructs valid arguments for each legacy alias, invokes callTool (same helper
used by other tests) and asserts the returned result/no error, referencing
callTool and the target functions get_recent_context, get_context_timeline,
get_timeline_by_query so the test explicitly verifies aliases still work after
removing old names from tools/list.

In `@internal/mcp/server.go`:
- Around line 623-625: Синхронизируйте текст инструкций инициализации с новой
витриной инструментов: в блоке initialize и в переменной engramInstructions
уберите упоминания legacy-инструментов, которые удалены из tools/list (например
ссылки/описания get_recent_context, get_context_timeline,
get_timeline_by_query), обновите описание так, чтобы объявлялись только текущие
инструменты и их алиасы/консолидации (timeline), и примените те же правки для
аналогичного текста в том же файле (также затрагивается участок, отмеченный
943-944).

In `@plugin/openclaw-engram/src/tools/engram-decisions.ts`:
- Around line 65-80: The output from formatDecisions can be corrupted by
unescaped Markdown characters in fields like d.title, d.narrative and the arrays
d.concepts/d.rejected; add a small helper (e.g., escapeMarkdown) that
replaces/escapes Markdown metacharacters (such as # * _ ` | []() - > etc.) and
apply it to d.title, each element of concepts and rejected, and to d.narrative
(or at least sanitize line-leading characters) before building the string in
formatDecisions so the generated Markdown remains literal and stable.

In `@plugin/openclaw-engram/src/tools/memory-forget.ts`:
- Around line 51-69: Сделайте валидацию ID единой для обоих путей: перед
проверкой parsed.data.permanent распарсьте parsed.data.id в numericId и если
Number.isNaN(numericId) или numericId <= 0 возвращайте `Invalid observation ID:
${parsed.data.id}`; затем используйте numericId при вызове
client.suppressObservation и приведите аргументы client.bulkDelete к единому
виду (передавать строку или числовой ID последовательно — лучше передавать
numericId как число или как строку, но только после успешной валидации), чтобы
для одного и того же невалидного ID оба пути возвращали одно и то же сообщение и
корректно обрабатывали response.deleted в случае archive.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: fb16f160-cd55-473d-bf7a-1cfdfdfb1135

📥 Commits

Reviewing files that changed from the base of the PR and between 4faae60 and c937563.

📒 Files selected for processing (13)

.agent/reports/plugin-api-gap-audit-2026-03-28.md
.agent/specs/mcp-tools-refactoring.md
.agent/specs/plugin-tool-consolidation/plan.md
.agent/specs/plugin-tool-consolidation/spec.md
.agent/specs/plugin-tool-consolidation/tasks.md
TECHNICAL_DEBT.md
internal/mcp/server.go
internal/mcp/server_test.go
internal/worker/handlers_import_export.go
plugin/openclaw-engram/package.json
plugin/openclaw-engram/src/client.ts
plugin/openclaw-engram/src/tools/engram-decisions.ts
plugin/openclaw-engram/src/tools/memory-forget.ts

coderabbitai · 2026-03-28T00:25:18Z

+- Test: timeout scenario → no error propagated
+
+**PR:** `feat: openclaw lifecycle hooks (outcome, utility, file context)`
+


⚠️ Potential issue | 🟡 Minor

Уточните путь до spec, чтобы избежать битой ссылки.

В Line 152 путь plugin-tool-consolidation/spec.md выбивается из остального формата (.agent/specs/...) и легко ломается при открытии из корня репозитория.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In @.agent/specs/plugin-tool-consolidation/plan.md at line 152, Обновите путь к спецификации на строке, где указан 'plugin-tool-consolidation/spec.md', чтобы он соответствовал общему формату и не ломался при открытии из корня репозитория: замените его на '.agent/specs/plugin-tool-consolidation/spec.md' в файле plan.md (строка с упоминанием пути 'plugin-tool-consolidation/spec.md'), проверив, что ссылка корректно открывается из корня проекта.

coderabbitai · 2026-03-28T00:25:18Z

+- `memory_forget` with both `suppress=true` and `permanent=true` — `permanent` wins
+- Statusline learning endpoint may return empty data (no sessions with outcomes yet) — show "eff: --"
+- Removed tools called via MCP with invalid parameters — same error handling as before
+- OpenClaw `after_tool_call` for Write on non-existent file path — skip file context injection


⚠️ Potential issue | 🟡 Minor

Несоответствия в описании граничных случаев.

Линия 184: Упоминается suppress=true and permanent=true, но memory_forget имеет только параметр permanent. Спецификация в FR-2b (линии 59-60) корректна — там только permanent: true. Граничный случай следует переформулировать.

Линия 187: Упоминается after_tool_call, но FR-4c (линии 90-92) определяет хук как before_tool_call. Это опечатка.

📝 Исправление граничных случаев

-- `memory_forget` with both `suppress=true` and `permanent=true` — `permanent` wins +- `memory_forget` with invalid ID and `permanent=true` — returns "Invalid observation ID" error - Statusline learning endpoint may return empty data (no sessions with outcomes yet) — show "eff: --" - Removed tools called via MCP with invalid parameters — same error handling as before -- OpenClaw `after_tool_call` for Write on non-existent file path — skip file context injection +- OpenClaw `before_tool_call` for Write on non-existent file path — skip file context injection

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In @.agent/specs/plugin-tool-consolidation/spec.md around lines 184 - 187, Update the two incorrect boundary-case bullets: change the first bullet that mentions "suppress=true and permanent=true" to reference only the existing memory_forget parameter "permanent=true" (matching FR-2b and the memory_forget API), and fix the second bullet which incorrectly names the hook "after_tool_call" to use the correct hook name "before_tool_call" (as defined in FR-4c); ensure the bullets mention the symbols memory_forget and before_tool_call so the reviewer can locate and verify the corrections.

coderabbitai · 2026-03-28T00:25:18Z

+**What:** Agents use 2 of 68 MCP tools. Need always_inject behavioral rules that trigger tool usage at the right moments: `find_by_file` before Edit/Write, `decisions` before architectural choices, `rate_memory` after session.
+**Why deferred:** Plugin tool consolidation (FR-1 through FR-6) must land first so rules reference stable tool names.
+**Impact:** High — this is the actual adoption fix. Tool consolidation reduces noise, behavioral rules drive usage.
+**Context:** `.agent/reports/plugin-api-gap-audit-2026-03-28.md`, `plugin-tool-consolidation/spec.md`


⚠️ Potential issue | 🟡 Minor

Сделайте путь до spec единообразным и полным.

В Line 152 лучше указывать .agent/specs/plugin-tool-consolidation/spec.md, чтобы ссылка открывалась стабильно из корня репозитория.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@TECHNICAL_DEBT.md` at line 152, Update the spec link in TECHNICAL_DEBT.md: find the occurrence of "plugin-tool-consolidation/spec.md" (near the ".agent/reports/plugin-api-gap-audit-2026-03-28.md" reference) and replace it with the rooted path ".agent/specs/plugin-tool-consolidation/spec.md" so the link resolves consistently from the repository root; ensure any other occurrences of the same spec reference in the file are updated to the same rooted form for consistency.

- session_end: detect session outcome (success/partial/failure/abandoned) from conversation signals, record via /api/sessions/{id}/outcome. Handles gracefully when no DB session ID exists. - before_tool_call: inject file-context observations before Write/Edit tools using /api/context/by-file. 200ms timeout, non-blocking. - Register before_tool_call hook in index.ts. - Bump openclaw-engram to 2.0.11.

- Fix edge case: memory_forget has only permanent param, not suppress - Fix edge case: before_tool_call not after_tool_call - Fix plan: version tracking says 2.0.x not 2.1.0

coderabbitai

Actionable comments posted: 7

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

plugin/openclaw-engram/src/index.ts (1)
125-133: ⚠️ Potential issue | 🟠 Major

Добавьте новые tool names в registerTool().

Фабрика на Line 114-122 создаёт все 9 новых инструментов, но registerTool() всё ещё публикует только старые 8 имён. В таком виде FR-3 остаётся частично недоступной: engram_rate, engram_suppress, engram_outcome, engram_find_by_file, engram_timeline, engram_changes, engram_how_it_works, engram_vault_store и engram_vault_get не входят в заявленный surface плагина.
💡 Предлагаемое исправление
     api.registerTool(toolFactory, {
       names: [
         'engram_search', 'memory_search',
         'engram_remember', 'memory_store',
         'engram_decisions',
         'memory_forget',
         'memory_get',
         'memory_migrate',
+        'engram_rate',
+        'engram_suppress',
+        'engram_outcome',
+        'engram_find_by_file',
+        'engram_timeline',
+        'engram_changes',
+        'engram_how_it_works',
+        'engram_vault_store',
+        'engram_vault_get',
       ],
     });
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@plugin/openclaw-engram/src/index.ts` around lines 125 - 133, The
api.registerTool call is missing the newly created tool names from the
toolFactory; update the names array passed to api.registerTool to include the
additional tools created by the factory (engram_rate, engram_suppress,
engram_outcome, engram_find_by_file, engram_timeline, engram_changes,
engram_how_it_works, engram_vault_store, engram_vault_get) alongside the
existing names so the plugin surface accurately exposes every tool produced by
toolFactory.

🧹 Nitpick comments (1)

plugin/openclaw-engram/src/tools/engram-presets.ts (1)
62-62: Захардкоженный заголовок ограничивает расширяемость.

Тернарный оператор preset === 'changes' ? 'Recent Changes' : 'How It Works' предполагает только два пресета. Если в будущем добавятся другие пресеты (например, decisions), заголовок будет некорректным.
♻️ Предложение: передавать заголовок как параметр
 function createPresetTool(
   name: string,
   description: string,
   preset: string,
+  header: string,
   ctx: OpenClawPluginToolContext,
   client: EngramRestClient,
   config: PluginConfig,
 ): AnyAgentTool {
   // ...
-      let out = `# ${preset === 'changes' ? 'Recent Changes' : 'How It Works'}\n\n`;
+      let out = `# ${header}\n\n`;
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@plugin/openclaw-engram/src/tools/engram-presets.ts` at line 62, The current
generation of the header uses a hardcoded ternary on the variable preset to
choose between "Recent Changes" and "How It Works" (see variable out and preset
in engram-presets.ts), which breaks when new presets are added; change the
implementation to accept a title parameter (or use a preset-to-title map) so the
caller supplies the header text instead of the file relying on preset ===
'changes' ? 'Recent Changes' : 'How It Works', update the code that calls this
generator to pass the proper title (or populate the map with keys like
'changes', 'decisions', etc.), and ensure out is built from that supplied title.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@plugin/engram/hooks/stop.js`:
- Around line 339-343: The call to
lib.requestGet(`/api/sessions/${sessionID}/injections`) returns an object with a
field injections, but the code currently checks Array.isArray(injections)
causing injectedObs to be empty; update the logic where injections is used
(variables injections and injectedObs in plugin/engram/hooks/stop.js) to read
the nested array (e.g., use injections?.injections) and set injectedObs =
Array.isArray(injections.injections) ? injections.injections : [] (guarding
against null/undefined) so the downstream send to '/utility' uses the real
injection list.

In `@plugin/openclaw-engram/src/client.ts`:
- Around line 354-364: Метод getTimeline вызывает
this.post('/api/context/search') и из-за отсутствия явного таймаута падает до
default 5s, что в режиме 'query' вызывает ложные timeouts; исправьте вызов в
getTimeline так, чтобы при обращении к '/api/context/search' передавался
увеличенный таймаут (например 15000 ms) в опциях вызова this.post (или
реиспользуйте существующую константу/функцию таймаута в кодовой базе), оставив
остальные параметры (project, mode, query, anchor_id, limit) без изменений.
- Around line 320-330: The setSessionOutcome method currently takes a numeric DB
id (sessionId: number) and uses it in the path
`/api/sessions/${sessionId}/outcome`, but the server expects the string Claude
session ID; change the signature of setSessionOutcome to accept a string
sessionId (or rename to sessionExternalId) and update all callers to pass the
string Claude session ID returned by initSession (not the numeric DB id), then
use that string in the post call to `/api/sessions/${sessionId}/outcome` so
outcomes are written to the correct session.

In `@plugin/openclaw-engram/src/tools/engram-outcome.ts`:
- Around line 55-63: В коде вы берёте sessionResp.id, но ответ от
client.initSession возвращает поле sessionDbId, поэтому sessionId всегда
становится 0; измените логику в обработке ответа от client.initSession (в месте
где используются переменные sessionResp и sessionId) чтобы читать
sessionResp.sessionDbId (проверить наличие свойства через 'sessionDbId' in
sessionResp или typeof sessionResp === 'object') и привести его к Number с
корректным fallback (например 0) — это гарантирует корректное значение sessionId
перед дальнейшими вызовами.

In `@plugin/openclaw-engram/src/tools/engram-vault.ts`:
- Around line 99-104: The current code calls
client.getCredential(parsed.data.name) and returns a combined message on null or
returns the plaintext secret (`${cred.value}`) which both masks the real failure
and leaks secrets; change the handling in engram-vault.ts so you first
distinguish the two failure modes returned by client.getCredential (treat "not
found" and "decryption failed" separately—use the explicit status or extend
getCredential to return a discriminated result), return a clear, non-sensitive
status message for each failure (e.g., "credential not found" vs "credential
decryption failed"), and avoid returning the raw credential value to the
caller/logs (return only masked value or an acknowledgement that the credential
exists).
- Around line 63-72: client.storeCredential currently returns only boolean so
network/auth errors surface as a generic false; update the implementation and
this call site to propagate error detail: change client.storeCredential to
return a discriminated result (e.g. StoreResult: { success: true } | { success:
false; reason: 'network' | 'auth' | 'unknown'; message?: string }) or throw
typed errors, then in engram-vault.ts handle the returned result (or catch
specific errors) and return distinct messages for auth (401/403), network, and
unknown failures instead of the single "Failed to store credential" string;
refer to client.storeCredential and the success variable/return block here to
implement the conditional branching and message mapping.
- Around line 60-61: Check for a missing project after calling resolveIdentity
and ensure we don't proceed with an empty project: if config.project is
undefined and resolveIdentity(ctx.agentId ?? '', ctx.workspaceDir) yields an
empty or falsy projectId, either throw a clear error or fallback to ctx.agentId
(if allowed) before using the project value; update the assignment around
resolveIdentity and project to validate the resolved identity.projectId and
handle the empty case (or document that ctx.agentId must be provided) so
downstream callers (e.g., code that relies on project-scoped credentials) never
receive an empty project.

---

Outside diff comments:
In `@plugin/openclaw-engram/src/index.ts`:
- Around line 125-133: The api.registerTool call is missing the newly created
tool names from the toolFactory; update the names array passed to
api.registerTool to include the additional tools created by the factory
(engram_rate, engram_suppress, engram_outcome, engram_find_by_file,
engram_timeline, engram_changes, engram_how_it_works, engram_vault_store,
engram_vault_get) alongside the existing names so the plugin surface accurately
exposes every tool produced by toolFactory.

---

Nitpick comments:
In `@plugin/openclaw-engram/src/tools/engram-presets.ts`:
- Line 62: The current generation of the header uses a hardcoded ternary on the
variable preset to choose between "Recent Changes" and "How It Works" (see
variable out and preset in engram-presets.ts), which breaks when new presets are
added; change the implementation to accept a title parameter (or use a
preset-to-title map) so the caller supplies the header text instead of the file
relying on preset === 'changes' ? 'Recent Changes' : 'How It Works', update the
code that calls this generator to pass the proper title (or populate the map
with keys like 'changes', 'decisions', etc.), and ensure out is built from that
supplied title.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 12f9473a-3bf0-483f-87de-1ae7b735f277

📥 Commits

Reviewing files that changed from the base of the PR and between c937563 and 3a8e14e.

📒 Files selected for processing (12)

plugin/engram/hooks/statusline.js
plugin/engram/hooks/stop.js
plugin/openclaw-engram/package.json
plugin/openclaw-engram/src/client.ts
plugin/openclaw-engram/src/index.ts
plugin/openclaw-engram/src/tools/engram-find-by-file.ts
plugin/openclaw-engram/src/tools/engram-outcome.ts
plugin/openclaw-engram/src/tools/engram-presets.ts
plugin/openclaw-engram/src/tools/engram-rate.ts
plugin/openclaw-engram/src/tools/engram-suppress.ts
plugin/openclaw-engram/src/tools/engram-timeline.ts
plugin/openclaw-engram/src/tools/engram-vault.ts

✅ Files skipped from review due to trivial changes (1)

plugin/openclaw-engram/package.json

coderabbitai · 2026-03-28T00:36:28Z

+      const identity = resolveIdentity(ctx.agentId ?? '', ctx.workspaceDir);
+      const project = config.project ?? identity.projectId;


⚠️ Potential issue | 🟡 Minor

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # Поиск обработки пустого project в серверном коде vault rg -n -C3 'project.*==""' --type=go rg -n -C3 'project\s*==\s*""' --type=go ast-grep --pattern 'if project == "" { $$$ }'

Repository: thebtf/engram

Length of output: 8780

Обработайте потенциально пустой project при отсутствии контекста.

Если ctx.agentId пуст и ctx.workspaceDir не задан, resolveIdentity может вернуть пустую строку в качестве projectId. Хотя сервер содержит валидацию на уровне handlers_vault.go:268 (отклонение пустого project для project-scoped credentials) и использует резервный механизм с использованием agentID (см. handlers_context.go:173-174), рекомендуется добавить явную проверку на клиентской стороне или документировать, что ctx.agentId должен быть установлен при использовании credentials without workspaceDir.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@plugin/openclaw-engram/src/tools/engram-vault.ts` around lines 60 - 61, Check for a missing project after calling resolveIdentity and ensure we don't proceed with an empty project: if config.project is undefined and resolveIdentity(ctx.agentId ?? '', ctx.workspaceDir) yields an empty or falsy projectId, either throw a clear error or fallback to ctx.agentId (if allowed) before using the project value; update the assignment around resolveIdentity and project to validate the resolved identity.projectId and handle the empty case (or document that ctx.agentId must be provided) so downstream callers (e.g., code that relies on project-scoped credentials) never receive an empty project.

coderabbitai · 2026-03-28T00:36:28Z

+      const success = await client.storeCredential(
+        parsed.data.name,
+        parsed.data.value,
+        parsed.data.scope,
+        project,
+      );
+
+      return success
+        ? `Credential "${parsed.data.name}" stored securely (scope: ${parsed.data.scope})`
+        : `Failed to store credential "${parsed.data.name}"`;


⚠️ Potential issue | 🟡 Minor

Невозможно различить типы ошибок при сохранении учётных данных.

Метод client.storeCredential() возвращает boolean, основанный на resp != null (см. client.ts:371-384). Если POST-запрос вернёт ошибку аутентификации (401/403) или сетевую ошибку после прохождения проверки isAvailable(), результат будет false, и пользователь увидит "Failed to store credential" вместо более точного сообщения об ошибке.

💡 Рекомендация по улучшению

Рассмотрите возможность расширения возвращаемого типа storeCredential для различения успеха, ошибки сети и ошибки авторизации:

// В будущем: вернуть объект с детализацией ошибки type StoreResult = { success: true } | { success: false; reason: 'network' | 'auth' | 'unknown' };

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@plugin/openclaw-engram/src/tools/engram-vault.ts` around lines 63 - 72, client.storeCredential currently returns only boolean so network/auth errors surface as a generic false; update the implementation and this call site to propagate error detail: change client.storeCredential to return a discriminated result (e.g. StoreResult: { success: true } | { success: false; reason: 'network' | 'auth' | 'unknown'; message?: string }) or throw typed errors, then in engram-vault.ts handle the returned result (or catch specific errors) and return distinct messages for auth (401/403), network, and unknown failures instead of the single "Failed to store credential" string; refer to client.storeCredential and the success variable/return block here to implement the conditional branching and message mapping.

coderabbitai · 2026-03-28T00:36:28Z

+      const cred = await client.getCredential(parsed.data.name);
+      if (!cred) {
+        return `Credential "${parsed.data.name}" not found or decryption failed`;
+      }
+
+      return `${cred.name}: ${cred.value}`;


⚠️ Potential issue | 🟡 Minor

Объединённое сообщение об ошибке скрывает истинную причину.

Метод client.getCredential() возвращает null как для случая "учётные данные не найдены", так и для "ошибка расшифровки" (см. client.ts:390-392). Сообщение на строке 101 упоминает оба варианта, но пользователь не может понять, какой именно произошёл.

Также обратите внимание: возврат расшифрованного значения в виде строки (строка 104) может привести к его попаданию в логи агента или историю чата.

💡 Рекомендация

В будущем рассмотрите возможность возврата из getCredential объекта с явным статусом:

type GetResult = | { found: true; name: string; value: string } | { found: false; reason: 'not_found' | 'decryption_failed' };

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@plugin/openclaw-engram/src/tools/engram-vault.ts` around lines 99 - 104, The current code calls client.getCredential(parsed.data.name) and returns a combined message on null or returns the plaintext secret (`${cred.value}`) which both masks the real failure and leaks secrets; change the handling in engram-vault.ts so you first distinguish the two failure modes returned by client.getCredential (treat "not found" and "decryption failed" separately—use the explicit status or extend getCredential to return a discriminated result), return a clear, non-sensitive status message for each failure (e.g., "credential not found" vs "credential decryption failed"), and avoid returning the raw credential value to the caller/logs (return only masked value or an acknowledgement that the credential exists).

- Suppress action now checks RowsAffected (not found = failed) - Cache invalidation extended to suppress action (was archive-only) - Unified ID validation in memory_forget: validate before branching

coderabbitai

Actionable comments posted: 4

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@plugin/openclaw-engram/src/hooks/before-tool-call.ts`:
- Line 7: handleBeforeToolCall synchronously awaits client.getFileContext(...)
which uses a 3000ms timeout, causing before_tool_call to block file Write/Edit;
change it to use a short, per-call deadline (e.g., 200ms) or race the
getFileContext Promise against a timeout/fallback so the hook returns quickly
with an empty context on timeout/failure; update the implementation in
handleBeforeToolCall to use a Promise.race or an AbortController/short timeout
wrapper around client.getFileContext and ensure before_tool_call always proceeds
with a default empty context when the short deadline elapses.

In `@plugin/openclaw-engram/src/hooks/session-end.ts`:
- Around line 23-47: The current heuristic builds textContent from all messages
and uses successPatterns/failurePatterns to set hasSuccess/hasFailure, which
causes earlier user-specified or historical errors to incorrectly mark a
completed session as "partial" or "failure"; modify the logic in session-end.ts
so that textContent is derived only from assistant/tool messages (e.g., filter
messages by m.role !== 'user' or m.author === 'assistant'/'tool') or by
prioritizing the last N messages (e.g., take the final assistant/tool turn(s))
before applying successPatterns and failurePatterns, then recompute
hasSuccess/hasFailure from that filtered text to decide outcome (references:
textContent, successPatterns, failurePatterns, hasSuccess, hasFailure).
- Around line 104-126: The session-end block calls initSession and
setSessionOutcome unconditionally and can overwrite an explicit engram_outcome
already written by engram-outcome.ts; change the logic to skip computing/setting
the outcome if the session already has an outcome (e.g., check the initSession
response for an existing engram_outcome field or equivalent on sessionResp) and
only call setSessionOutcome when no outcome exists, using detectOutcome() as a
fallback; reference the functions initSession, setSessionOutcome and
detectOutcome and ensure the path is guarded so no duplicate write occurs.

In `@plugin/openclaw-engram/src/tools/memory-forget.ts`:
- Around line 14-24: The id validation currently allows non-integer formats;
update the static schemas ForgetParamsSchema and forgetParameters to require
/^[1-9]\d*$/ (positive decimal integers, no leading zeros), replace the runtime
Number() parsing with parseInt/Number.isSafeInteger checks (e.g.,
parseInt(parsed.data.id, 10) then Number.isSafeInteger on the result) to ensure
it's a safe integer, and pass the normalized integer (e.g., parsedInt.toString()
or the integer itself consistently) into bulkDelete instead of reusing
parsed.data.id so bulkDelete receives a canonical, validated id.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 4b811b9d-2ba3-4dfb-83e9-e3736447e278

📥 Commits

Reviewing files that changed from the base of the PR and between 3a8e14e and a65df54.

📒 Files selected for processing (8)

.agent/specs/plugin-tool-consolidation/plan.md
.agent/specs/plugin-tool-consolidation/spec.md
internal/worker/handlers_import_export.go
plugin/openclaw-engram/package.json
plugin/openclaw-engram/src/hooks/before-tool-call.ts
plugin/openclaw-engram/src/hooks/session-end.ts
plugin/openclaw-engram/src/index.ts
plugin/openclaw-engram/src/tools/memory-forget.ts

✅ Files skipped from review due to trivial changes (3)

plugin/openclaw-engram/package.json
.agent/specs/plugin-tool-consolidation/plan.md
.agent/specs/plugin-tool-consolidation/spec.md

🚧 Files skipped from review as they are similar to previous changes (1)

plugin/openclaw-engram/src/index.ts

coderabbitai · 2026-03-28T00:48:01Z

+  const textContent = messages
+    .map((m) => (typeof m.content === 'string' ? m.content : '').toLowerCase())
+    .join('\n');
+
+  // Success signals: explicit completion markers
+  const successPatterns = [
+    'task complete', 'done', 'finished', 'implemented', 'fixed',
+    'merged', 'deployed', 'resolved', 'committed', 'created pr',
+  ];
+  const hasSuccess = successPatterns.some((p) => textContent.includes(p));
+
+  // Failure signals: explicit error/failure markers
+  const failurePatterns = [
+    'failed', 'error', 'cannot', 'unable to', 'broke', 'regression',
+  ];
+  const hasFailure = failurePatterns.some((p) => textContent.includes(p));
+
+  if (hasSuccess && !hasFailure) {
+    return { outcome: 'success', reason: 'completion signals detected' };
+  }
+  if (hasSuccess && hasFailure) {
+    return { outcome: 'partial', reason: 'mixed success and failure signals' };
+  }
+  if (hasFailure) {
+    return { outcome: 'failure', reason: 'failure signals detected' };


⚠️ Potential issue | 🟠 Major

Эта эвристика слишком охотно помечает успешные сессии как partial или failure.

textContent собирается из всего диалога, включая пользовательские сообщения и ранние ошибки. Сценарий вроде «исправь error X» → «done» здесь уже даёт hasFailure=true, поэтому итог смещается к partial, хотя задача могла завершиться успешно. Лучше анализировать только последние assistant/tool-сообщения или хотя бы давать приоритет финальным ходам.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@plugin/openclaw-engram/src/hooks/session-end.ts` around lines 23 - 47, The current heuristic builds textContent from all messages and uses successPatterns/failurePatterns to set hasSuccess/hasFailure, which causes earlier user-specified or historical errors to incorrectly mark a completed session as "partial" or "failure"; modify the logic in session-end.ts so that textContent is derived only from assistant/tool messages (e.g., filter messages by m.role !== 'user' or m.author === 'assistant'/'tool') or by prioritizing the last N messages (e.g., take the final assistant/tool turn(s)) before applying successPatterns and failurePatterns, then recompute hasSuccess/hasFailure from that filtered text to decide outcome (references: textContent, successPatterns, failurePatterns, hasSuccess, hasFailure).

coderabbitai · 2026-03-28T00:48:01Z

+    // 2. Record session outcome (fire-and-forget)
+    // Resolve DB session ID — may not exist if session was never initialized
+    void (async () => {
+      try {
+        const sessionResp = await client.initSession({
+          claudeSessionId: sessionId,
+          project,
+          prompt: '',
+        });
+
+        const dbSessionId =
+          sessionResp && typeof sessionResp === 'object' && 'id' in sessionResp
+            ? Number(sessionResp.id)
+            : 0;

-    // Fire-and-forget — do not await
-    void client.backfillSession({
-      session_id: sessionId,
-      project,
-      content: truncated,
-    });
-
-    (logger ?? console).warn(
-      `[engram] session-end: submitted ${recent.length} messages for backfill` +
-        ` (project ${project}, reason: ${event.reason ?? 'unknown'})`,
-    );
+        if (dbSessionId <= 0) {
+          (logger ?? console).warn('[engram] session-end: no DB session ID — skipping outcome');
+          return;
+        }
+
+        const { outcome, reason } = detectOutcome(messages);
+        await client.setSessionOutcome(dbSessionId, outcome, reason);
+        (logger ?? console).warn(`[engram] session-end: outcome=${outcome} (${reason})`);


⚠️ Potential issue | 🟠 Major

Не дублируйте запись outcome поверх явного engram_outcome.

Этот блок безусловно делает второй initSession + setSessionOutcome на session_end, хотя plugin/openclaw-engram/src/tools/engram-outcome.ts:41-78 уже пишет outcome для той же сессии. Это создаёт non-idempotent write hazard: эвристика из detectOutcome() может перебить явно заданный результат либо, в лучшем случае, просто добавляет лишний write на закрытии сессии. Нужен guard на "outcome already set" или одноразовая/idempotent семантика на сервере.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@plugin/openclaw-engram/src/hooks/session-end.ts` around lines 104 - 126, The session-end block calls initSession and setSessionOutcome unconditionally and can overwrite an explicit engram_outcome already written by engram-outcome.ts; change the logic to skip computing/setting the outcome if the session already has an outcome (e.g., check the initSession response for an existing engram_outcome field or equivalent on sessionResp) and only call setSessionOutcome when no outcome exists, using detectOutcome() as a fallback; reference the functions initSession, setSessionOutcome and detectOutcome and ensure the path is guarded so no duplicate write occurs.

thebtf · 2026-03-28T00:48:28Z

@coderabbitai review incremental

coderabbitai · 2026-03-28T00:48:41Z

@thebtf Sure! Triggering an incremental review of the PR now.

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

CRIT: engram_outcome uses sessionDbId (not .id) from initSession response MAJOR: - stop.js: read injectionsResp.injections (wrapped response, not root array) - before-tool-call: 500ms timeout (was 3s — too slow for pre-tool hook) - session-end: use sessionDbId, soften heuristic (multi-word patterns), conservative default (partial, not abandoned) - client.ts: timeline uses 15s timeout (matches searchContext), getFileContext accepts configurable timeoutMs

thebtf · 2026-03-28T00:53:27Z

@coderabbitai review incremental

coderabbitai · 2026-03-28T00:53:37Z

@thebtf Sure! Triggering an incremental review of the PR now.

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

Sonnet lite review found: server UpdateSessionOutcome takes claude_session_id string, not numeric DB ID. All outcome calls (engram_outcome tool + session_end hook) now pass ctx.sessionId directly — no initSession lookup needed. - client.ts: setSessionOutcome accepts string, URL-encodes it - engram-outcome.ts: removed initSession, pass claudeSessionId directly - session-end.ts: simplified — no DB ID resolution needed

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (2)

plugin/openclaw-engram/src/hooks/session-end.ts (2)
102-112: ⚠️ Potential issue | 🟠 Major

Дублирование записи outcome: session-end может перезаписать явный engram_outcome.

Хук безусловно вызывает setSessionOutcome, даже если агент явно вызвал engram_outcome ранее. Поскольку setSessionOutcome на сервере не имеет идемпотентности (см. client.ts:321-332), эвристический outcome может перезаписать явно заданный пользователем.

Рекомендуется добавить проверку наличия существующего outcome на сервере или флаг в контексте сессии.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@plugin/openclaw-engram/src/hooks/session-end.ts` around lines 102 - 112, The
hook currently always calls detectOutcome(...) and
client.setSessionOutcome(...), which can overwrite an explicit engram_outcome;
update the session-end logic to first check whether an outcome already exists or
a session flag indicates an explicit outcome before calling setSessionOutcome:
query the server for an existing outcome (e.g., via a
getSessionOutcome/getSession or new client method) and only call
client.setSessionOutcome(sessionId, outcome, reason) if no existing
outcome/explicit flag is present, or alternatively consult a session context
flag set when engram_outcome is used and skip the heuristic setSessionOutcome
when that flag is true; adjust detectOutcome usage accordingly so it only runs
when you will actually persist the result.
23-52: ⚠️ Potential issue | 🟠 Major

Эвристика detectOutcome анализирует весь диалог, включая ранние сообщения пользователя.

textContent собирается из всех сообщений, включая пользовательские. Если пользователь написал «исправь build failed» → агент успешно исправил → итог будет partial из-за раннего упоминания ошибки. Рекомендуется анализировать только последние N assistant/tool-сообщений.
♻️ Вариант: фильтровать только assistant/tool сообщения
   const textContent = messages
+    .filter((m) => m.role === 'assistant' || m.role === 'tool')
+    .slice(-10) // последние 10 релевантных сообщений
     .map((m) => (typeof m.content === 'string' ? m.content : '').toLowerCase())
     .join('\n');
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@plugin/openclaw-engram/src/hooks/session-end.ts` around lines 23 - 52, The
detectOutcome heuristic currently builds textContent from all messages
(including early user messages) which can misclassify outcomes; modify the logic
that produces textContent (based on messages) to first filter to only
assistant/tool messages and limit to the last N entries (e.g., last 5–10) before
mapping and joining, so detectOutcome (and related variables
hasSuccess/hasFailure) inspects only recent assistant/tool outputs rather than
the entire dialog.

🧹 Nitpick comments (1)

plugin/openclaw-engram/src/tools/engram-outcome.ts (1)
51-54: Рассмотрите добавление fallback на agentId для консистентности с session-end.

В session-end.ts:77 используется трёхступенчатый fallback: ctx.sessionId ?? ctx.sessionKey ?? agentId. Здесь же только двухступенчатый. Это может привести к ситуации, когда хук запишет outcome, а инструмент вернёт ошибку «no session ID available».
♻️ Опционально: добавить fallback на ctx.agentId
-      const claudeSessionId = ctx.sessionId ?? ctx.sessionKey ?? '';
+      const claudeSessionId = ctx.sessionId ?? ctx.sessionKey ?? ctx.agentId ?? '';
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@plugin/openclaw-engram/src/tools/engram-outcome.ts` around lines 51 - 54,
Update the session-id fallback used when recording outcomes to match
session-end.ts by including ctx.agentId as the third fallback; specifically
change the construction of claudeSessionId (currently using ctx.sessionId ??
ctx.sessionKey ?? '') to use ctx.sessionId ?? ctx.sessionKey ?? ctx.agentId so
the engram outcome writer and session-end logic remain consistent and avoid "no
session ID available" mismatches.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@plugin/openclaw-engram/src/hooks/before-tool-call.ts`:
- Line 7: The comment in before-tool-call.ts states "Non-blocking: 200ms
timeout..." but the actual timeout passed in the code is 500ms; make them
consistent by either changing the comment to 500ms or changing the timeout
argument (the 500ms literal passed where the non-blocking timeout is set) to
200ms so the documentation matches the implemented value in before-tool-call.ts.

---

Duplicate comments:
In `@plugin/openclaw-engram/src/hooks/session-end.ts`:
- Around line 102-112: The hook currently always calls detectOutcome(...) and
client.setSessionOutcome(...), which can overwrite an explicit engram_outcome;
update the session-end logic to first check whether an outcome already exists or
a session flag indicates an explicit outcome before calling setSessionOutcome:
query the server for an existing outcome (e.g., via a
getSessionOutcome/getSession or new client method) and only call
client.setSessionOutcome(sessionId, outcome, reason) if no existing
outcome/explicit flag is present, or alternatively consult a session context
flag set when engram_outcome is used and skip the heuristic setSessionOutcome
when that flag is true; adjust detectOutcome usage accordingly so it only runs
when you will actually persist the result.
- Around line 23-52: The detectOutcome heuristic currently builds textContent
from all messages (including early user messages) which can misclassify
outcomes; modify the logic that produces textContent (based on messages) to
first filter to only assistant/tool messages and limit to the last N entries
(e.g., last 5–10) before mapping and joining, so detectOutcome (and related
variables hasSuccess/hasFailure) inspects only recent assistant/tool outputs
rather than the entire dialog.

---

Nitpick comments:
In `@plugin/openclaw-engram/src/tools/engram-outcome.ts`:
- Around line 51-54: Update the session-id fallback used when recording outcomes
to match session-end.ts by including ctx.agentId as the third fallback;
specifically change the construction of claudeSessionId (currently using
ctx.sessionId ?? ctx.sessionKey ?? '') to use ctx.sessionId ?? ctx.sessionKey ??
ctx.agentId so the engram outcome writer and session-end logic remain consistent
and avoid "no session ID available" mismatches.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 0940eeba-eb73-4fd7-8f69-f1901f39bba0

📥 Commits

Reviewing files that changed from the base of the PR and between a65df54 and 9bc6718.

📒 Files selected for processing (5)

plugin/engram/hooks/stop.js
plugin/openclaw-engram/src/client.ts
plugin/openclaw-engram/src/hooks/before-tool-call.ts
plugin/openclaw-engram/src/hooks/session-end.ts
plugin/openclaw-engram/src/tools/engram-outcome.ts

🚧 Files skipped from review as they are similar to previous changes (1)

plugin/openclaw-engram/src/client.ts

MAJOR: - session_store.go: UpdateSessionOutcome only sets if outcome IS NULL — explicit engram_outcome tool takes priority over heuristic - memory-forget.ts: strict integer regex + parseInt + isSafeInteger validation MINOR: - vault.ts: descriptive error messages for store/get failures - vault.ts: comment about credential value in tool output - before-tool-call.ts: doc says 500ms (matches code) - TECHNICAL_DEBT.md: full spec path

* refactor: move max_tokens from hardcoded 4096 to ENGRAM_LLM_MAX_TOKENS (#49) Configurable via env var ENGRAM_LLM_MAX_TOKENS (default: 4096). Stored in config.Config.LLMMaxTokens and OpenAIConfig.MaxTokens. Removes magic number from LLM client. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * docs: add pre-commit guardrails + re-benchmark tech debt items * fix: scoring formulas — guidance type weight/decay + meaningful total_results (#50) - Add type=guidance to typeWeights (1.8, highest) and typeHalfLife (365 days) - Behavioral rules no longer decay in 7 days or get default weight 1.0 - sourceBoost 1.3 for LLM-extracted guidance (live user_behavior detection) - total_results now counts observations with composite score > 0.05 (was raw DB count — in high-dim space all observations passed threshold, showing "33 matches" for every query regardless of relevance) Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * fix: exclude behavioral rules from contradiction detection (#51) Imported feedback rules (type=decision, concept=user-preference, title starts with "Rule:") were all classified as contradicting each other because classifyRelation marks any two decisions with different titles and similarity > 0.7 as contradicts. 57 rules × 56 peers = 76 false contradiction edges in the knowledge graph. Added hasGuidanceConcept() check: skips contradiction detection for observations that are behavioral rules (type=guidance, or concept user-preference, or title prefix "Rule:"). Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v1.6.4 * chore: update marketplace for v1.6.5 * fix: filter heartbeat and Telegram metadata from user prompts (#52) Skip HEARTBEAT.md polling (openclaw every 30min) and Telegram conversation/sender metadata from being stored as user prompts. These are system-generated, not real user interactions. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * feat: register PreCompact hook and add discovery logging PreCompact hook was created but never registered in hooks.json. Now registered with 10s timeout. Hook writes discovery data to .agent/pre-compact-discovery.json for empirical testing of available input fields (transcript_path verification for FR-2). * feat: always-inject tier for behavioral rules (FR-1, FR-6) Three-tier injection system: observations tagged with concept "always-inject" are now fetched independently of similarity matching and included in every session (session-start) and every prompt (user-prompt) context. Server changes: - GetAlwaysInjectObservations query (concepts @> GIN index) - GIN indexes on concepts, files_modified, files_read columns - Migration 048 for all new indexes - handleContextInject + handleSearchByPrompt return always_inject array - AlwaysInjectLimit (default 20) and ProjectInjectLimit (default 15) config Hook changes: - session-start.js renders <user-behavior-rules> block before <engram-context> - user-prompt.js merges always-inject + similarity-matched rules with dedup - Plugin version bumped to 0.6.0 Also adds GetObservationsByFile and GetPreviousObservationInSession queries for Phase 3 and Phase 4 (no callers yet). * feat: PreCompact hook sends full transcript to backfill (FR-2) Reads transcript JSONL at compaction time, parses all user/assistant messages, and sends to /api/backfill/session in chunks of 50 messages. Fire-and-forget with 5s timeout per chunk (Constitution Principle 3). Fallback: if input.transcript_path is missing, derives path by searching ~/.claude/projects/<hash>/<session>.jsonl. Also writes discovery report to .agent/pre-compact-discovery.json for empirical verification of available hook input fields. * feat: PreToolUse file-context injection (FR-3) New hook and endpoint for automatic file-specific knowledge injection before Edit/Write operations. Server: - GET /api/context/by-file endpoint (handlers_context_file.go) - Returns observations matching files_modified/files_read - Graceful degradation: empty response on error (NFR-3) Hook: - pre-tool-use.js matches Edit/Write tools only - Extracts file_path, queries /api/context/by-file - Returns <file-context> XML block as systemMessage - 200ms timeout with empty fallback - Registered in hooks.json with "Edit|Write" matcher * feat: causal chain linking — follows + prompted_by relations (FR-4, FR-5) Observations within the same session are now automatically linked: - "follows" relation: connects consecutive observations by prompt_number - "prompted_by" relation: links observation to the user prompt that triggered it Both relations are created via pure DB queries (< 10ms overhead per observation, NFR-4) during the existing relation detection pipeline. Changes: - relation/detector.go: add temporal + prompt linking before similarity search - prompt_store.go: add GetPromptForObservation query - service.go: pass promptStore to NewDetector constructor * refactor: extract shared normalizeEngramContent helper and normalize write-tool check - Create plugin/openclaw-engram/src/hooks/content.ts with normalizeEngramContent() centralizing stripEngramContext + CONTENT_MAX_CHARS truncation used by both before-compaction and session-end hooks (eliminates duplicate implementations) - Update before-compaction.ts and session-end.ts to import and use the shared helper - Simplify WRITE_TOOLS Set to lowercase-only entries and normalize via toolName.toLowerCase() in isWriteOrEdit() for reliable case-insensitive matching * fix: convert text columns to jsonb before GIN index creation Migration 048 failed because concepts, files_modified, files_read were stored as text type. PostgreSQL GIN indexes require jsonb. Fix: ALTER COLUMN TYPE jsonb USING COALESCE(col::jsonb, '[]'::jsonb) before CREATE INDEX. Also update GORM model tags from type:text to type:jsonb for consistency. * fix: Phase 1 — Security & Reliability (P0) (#57) * fix: security and reliability improvements (Phase 1 T001-T005) - T001/T002: Apply privacy.RedactSecrets to LLM extraction output before parsing observations (Constitution P9 fix). Both live extraction (processor.go) and backfill (handlers_backfill.go). - T003: Expand CSP headers from `default-src 'self'` to full directive set with script/style/connect/img/font/frame rules. - T004: Add truncated args (200 chars) to MCP tool call error log. - T005: Add diagnostic state (llmClient configured status) to callLLM error messages for debugging. * feat: MCP health monitoring, bounded semaphore, fire-and-forget vault (T006-T010) - T006: New internal/mcp/health.go — atomic request/error counters with 5-minute sliding window for MCP endpoint monitoring - T007: GET /api/mcp/health public endpoint registered - T008: Streamable HTTP handler wired to health counters - T009: Removed nil-semaphore unbounded goroutine fallback — always use bounded semaphore, drop on overflow with warning - T010: vaultStoreDetectedSecrets now fire-and-forget with 3s timeout goroutine (Constitution P3 compliance) * fix: address PR review findings — CSP hardening, args redaction, race fix - CSP: add object-src 'none' + base-uri 'self' per Gemini review - Redact args in error log before logging (prevent secret leakage) - Fix TOCTOU race in MCPHealth.rotateWindowIfNeeded with CompareAndSwap - TODO: migrate unsafe-inline to nonce/hash-based CSP --------- Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * feat: dashboard REST endpoints + MCP tool aliases (Phase 2+4) (#58) Dashboard backend (Phase 2): - POST /api/observations/batch-tag — bulk tag add/remove - DELETE /api/observations/bulk — bulk delete observations - PATCH /api/observations/bulk-scope — bulk scope change - GET /api/observations/tag-cloud — top tags with counts - GET /api/auth/tokens/:id/stats — per-token usage stats - auth_disabled field in /api/auth/me response MCP tools (Phase 4): - find_by_file_context — wraps GetObservationsByFile - include_all parameter for tools/list (+ cursor: "all" compat) - Vault aliases: vault_store, vault_get, vault_list, vault_delete - Document aliases: doc_list_collections, doc_get, doc_ingest, etc. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * feat: Phases 2-5 + 8a/8b/8c/8d — dashboard, self-learning, consistency, documents (#59) * feat: dedup threshold, manual search signal, install.sh client-only, docs fix - T029: Raise DedupSimilarityThreshold from 0.55 to 0.7 (pre-test confirmed safe) - T031: Add manual search feedback signal in stop.js — detects engram tool usage during session, sends insufficient_injection signal - T038: install.sh defaults to --client-only (skips engram-server binary) - T039: Fix cmplus-server naming in DEPLOYMENT.md to engram-server * feat: intentional links + file→observation graph edges (FR-36, FR-37) - T055: Parse [[obs:1234]] syntax in narratives → create bidirectional references/referenced_by graph edges - T056: files_modified/files_read entries → modifies/reads graph edges using FNV-1a hash of file path as stable node ID - Both integrated into existing Detect() pipeline (event-driven async) * feat: add GetCluster to GraphStore interface (FR-38) - GraphStore interface: GetCluster(nodeID, maxNodes) returns cluster IDs - FalkorDB: BFS traversal up to 3 hops with LIMIT - NoopGraphStore: returns empty slice * feat: LLM causal classifier for error→fix and correction linking (FR-44/45) - New causal_classifier.go: LLM prompt classifies observation pairs as fixed_by, corrects, or unrelated - Wired into Detect() pipeline: triggers for bugfix/guidance types on top-3 similarity candidates only (~1 LLM call per 5 observations) - SetCausalClassifier() method on Detector (opt-in, nil = disabled) - ShouldClassify() filter: only bugfix and guidance types * feat: migration 051 — documents + document_comments tables (FR-46) Foundation for AI agent collaboration platform: - documents: versioned, typed (markdown/task/review/decision), JSONB metadata (assignee/status/priority), author attribution - document_comments: inline and general comments with line ranges - Indexes: project+path+version, doc_type, document_id * feat: Phase 2 frontend + Phase 3 self-learning + Phase 8a consistency + document store Phase 2 Frontend (T017-T021): - Bulk action dropdown (delete/scope/tag) in ObservationsView - Tag cloud sidebar with clickable filters - Per-token stats (request count, last used) in TokensView - Auth-disabled warning badge in AppSidebar - Vault encryption setup helper in SystemView Phase 3 Self-Learning (T023-T028): - Injection floor: always inject at least N observations (default 3) - Cross-session priming: 1.3x boost for recent sessions - Adaptive per-project relevance threshold (project_settings table) - Feedback-driven threshold adjustment (used→lower, ignored→raise) Phase 8a Consistency Engine (T050-T054): - Orphan vector cleanup (vectors without observations) - Missing vector detection (observations without embeddings) - Stale relation cleanup (broken source/target references) - FalkorDB↔PostgreSQL drift detection + auto re-sync - Embedding model change detection via system_config table Phase 8d Document Store (T061): - VersionedDocumentStore with Create/ReadLatest/ReadVersion/List/ GetHistory/AddComment/GetComments GORM methods - SHA-256 content hashing, version tracking, DISTINCT ON for latest * fix: address 12 PR review findings on Phase 2-8 implementation - project_settings: explicit error on DB failure (not silent 0.3) - falkordb GetCluster: ParameterizedQuery instead of string interpolation - migration 051: renamed to versioned_documents (avoid conflict with m017) - versioned_document_store: time.Time fields, transaction on Create, table names - detector: negative file node IDs, corrected causal classification pair order - maintenance: fix SQL column names for stale relation cleanup - install.sh: proper flag parsing (--full doesn't corrupt version arg) - SystemView: vault copy error feedback - AppSidebar: deduplicate auth/me fetch via useAuth composable - ObservationsView: project-aware tag cloud + refresh after mutations - observation_store: deduplicate GetTopImportanceObservations --------- Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * feat: document MCP tools + OpenClaw message classification (T062-T064, T047-T049) (#60) Document MCP tools (T062): - 6 new tools: doc_create, doc_read, doc_update, doc_list, doc_history, doc_comment - VersionedDocumentStore wired into MCP server and service - T063 skipped (memory_get not an MCP tool) - T064: embedding integration point marked with TODO OpenClaw message classification (T047-T049): - New message-classifier.ts with allowlist approach for heartbeat/system detection - before-prompt-build.ts + after-tool-call.ts updated to use classifier - source: "openclaw" added to observation storage calls - always_inject rendering verified in context injection Bump openclaw-engram version 1.3.1 → 1.4.0 Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v1.7.0 * fix: write pre-compact discovery JSON to project dir (#61) * fix: write pre-compact discovery JSON to project dir, not plugin dir The PreCompact hook used __dirname-relative path to write pre-compact-discovery.json, which resolved to the plugin install cache instead of the project .agent/ directory. Use ctx.CWD instead. * fix: simplify projectDir fallback in pre-compact hook ctx.CWD is already derived from input.cwd in lib.js with type safety, making the intermediate input.cwd check redundant and potentially unsafe (truthy non-string values would bypass ctx.CWD's type guarantee). --------- Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * fix: dashboard type filter, tag cloud, SSE auth (#62) * fix: server-side type filter, add guidance type, fix tag cloud SQL - Add `type` query param to GET /api/observations for server-side filtering - Add obsType param to GetAllRecentObservationsPaginated and GetObservationsByProjectStrictPaginated with optional WHERE clause - Frontend: pass type to API, remove client-side filteredObservations filter - Add `guidance` to ObservationType union, OBSERVATION_TYPES, TYPE_CONFIG - Fix tag cloud SQL: COALESCE(is_superseded, 0) = 0 (bigint, not boolean) * fix: support query param token auth for SSE EventSource EventSource API cannot set custom headers. Add ?token= query param fallback in auth middleware so SSE /api/events can authenticate. * fix: address review findings — DRY query builder, restrict token query param - Refactor observation store: extract buildBaseQuery helper to reduce duplication between paginated functions - Restrict query param token auth to SSE-only endpoints (/api/events, /sse, /api/logs) instead of all routes --------- Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * fix: sidebar metrics use snake_case to match API response (#63) RetrievalStats interface used PascalCase (TotalRequests) but API returns snake_case (total_requests). Sidebar showed 0 for all retrieval metrics. Fixed in api.ts, Sidebar.vue, AppSidebar.vue. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * docs: update TECHNICAL_DEBT.md with dashboard findings (#64) Add entries for type filter (resolved), SDK extraction types, and dashboard memories view feature request. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v1.7.1 * fix: reuse existing session in bulk import instead of creating phantom (#65) - Add optional session_id to BulkImportRequest — if provided, uses CreateSDKSession with that ID (idempotent: INSERT OR IGNORE + fetch) - If not provided, falls back to bulk-import-{timestamp} (backward compat) - OpenClaw engram-remember tool now passes ctx.sessionId to bulkImport - Fixes 403+ phantom bulk-import-* sessions in openclaw project - Bump openclaw-engram 1.4.0 → 1.4.1 Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v1.7.2 * fix: migration 052 — delete phantom bulk-import sessions (#66) Cleanup 403+ phantom sdk_sessions created by bulk-import before PR #65. Deletes sessions matching 'bulk-import-%' with prompt_counter=0. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v1.7.3 * fix: LLM extraction now produces diverse observation types (#67) Previously all extracted learnings were hardcoded to type=guidance. Now: - Prompt asks LLM to classify as guidance/decision/bugfix/discovery/ feature/refactor/change - learningToObsType() maps LLM type to observation type - Legacy signal field still supported as fallback Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * fix: migration 053 — delete vault credentials with lost encryption key (#68) All 15 credentials encrypted with auto-generated key that was lost when Docker container was recreated. AES-256-GCM = unrecoverable. Vault status confirmed mismatch_count=15 = credential_count=15. Users will re-create credentials with current ENGRAM_VAULT_KEY. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v1.7.4 * feat: observation status lifecycle (v1.8 Phase 1) (#69) * feat: observation status lifecycle — migration, model, API, MCP (Phase 1 backend) - Migration 054: status TEXT DEFAULT 'active' + status_reason TEXT + index - Observation model: Status + StatusReason fields in GORM and shared models - ObservationUpdate: Status + StatusReason fields for edit_observation - Paginated queries: status filter param (backward compat, "" = all) - Context injection: COALESCE(status, 'active') = 'active' on all query paths - handleGetObservations: ?status= query param - edit_observation MCP tool: status (enum active/resolved) + status_reason * feat: observation status lifecycle UI (Phase 1 frontend) - TypeScript: status + status_reason fields on Observation interface - API client: status param in fetchObservationsPaginated, updateObservationStatus() - ObservationsView: status pill toggle (All/Active/Resolved), resolve button with reason modal, resolved card styling (opacity-50 + line-through), reopen button (green), bulk resolve, status_reason tooltip --------- Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * feat: dashboard Memories view — filter, card variant, inline edit, delete (Phase 2) (#70) - Backend: memory_type filter in paginated queries ("any" = all memories, specific value = exact match) - handleGetObservations: ?memory_type= query param - Frontend: All/Memories toggle, memory cards with purple accent + brain icon, scope badges (project/global), inline edit (title + narrative), delete with confirm Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * feat: pattern insight — LLM summary + source observations (Phase 3) (#71) - New: GET /api/patterns/{id}/observations — resolve observation_ids - New: POST /api/patterns/{id}/insight — LLM summary with cache - New: internal/learning/pattern_insight.go — GeneratePatternInsight - Frontend: inline expand on pattern card (replaces useless modal), skeleton loading, LLM summary + source observation list, cache indicator, unavailable fallback with retry Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * feat: pattern cleanup — orphan detection, confidence recalc, bulk archive (Phase 4) (#72) - Orphan pattern detection: verify observation_ids against existing observations - Batch confidence recalculation using existing formula - POST /api/maintenance/patterns/cleanup with dry_run + threshold params - Frontend: cleanup section with preview (dry_run) + confirm button Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: mark v1.8 items resolved in TECHNICAL_DEBT.md (#73) Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * fix: remove unused TypeScript imports in usePatterns (CI build fix) (#74) fetchPatternInsight and legacyInsights were declared but never read. vue-tsc strict mode treats these as errors. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v1.8.0 * fix: 3 post-v1.8.0 bugs — memories empty, insight timeout, filter mess (#75) 1. store_memory now sets memory_type via ClassifyMemoryType() — was never populated, causing Memories tab to show "No observations found" 2. Pattern insight: 5s context timeout + nil LLM guard — was hanging indefinitely when LLM model loading was slow 3. ObservationsView filter bar restructured: 2-row layout with project + view mode on top, type filters + status pills below with divider. Type filters hidden in Memories view. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v1.8.0 * fix: migration 055 — backfill memory_type for existing store_memory observations (#76) Existing observations from store_memory (source_type='manual') had empty memory_type. Classifies using same logic as ClassifyMemoryType(): type=guidance→guidance, concepts keywords→decision/pattern/preference/etc, default→context. Enables Memories tab to show historical memories. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * fix: increase pattern insight LLM timeout from 5s to 30s (#77) Ollama cold start can take 10-30s for model loading. 5s was too aggressive for interactive (non-hot-path) insight generation. Extraction pipeline works because it has no timeout constraint. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * fix: backfill NULL status to 'active', COALESCE in paginated queries (#78) A1 anomaly: migration 054 ADD COLUMN DEFAULT only sets value for new rows. 708 existing observations had status=NULL. Dashboard "Active" filter matched 0 because WHERE status='active' skips NULLs. - Migration 055: UPDATE SET status='active' WHERE NULL - Paginated queries: COALESCE(status, 'active') = ? for safety - Renumbered memory_type backfill to migration 056 Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * fix: LLM API key falls back to embedding key when not set (#79) ENGRAM_LLM_API_KEY was empty while ENGRAM_EMBEDDING_API_KEY was set. Both point to same LiteLLM proxy but LLM chat completions sent requests without auth → 401 → context deadline exceeded. Now DefaultOpenAIConfig falls back to ENGRAM_EMBEDDING_API_KEY, matching existing URL fallback pattern. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * fix: pattern insight timeout 30s→120s for Ollama cold start (#80) Ollama loads 9B model from disk in 30-60s on cold start. qwen3-8b took 58s to respond with 20 tokens. 30s was not enough — increased to 120s for interactive insight. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v1.8.0 * fix: increase OpenClaw inject/search timeout 5s→15s to prevent cooldown (#81) Root cause: OpenClaw client default timeout = 5s. Inject endpoint returns 80KB+ with vector search, taking 0.7-2s normally but longer under load. 3 consecutive AbortController timeouts → AvailabilityTracker cooldown 60s → ALL engram tools disabled (search, decisions, store_memory, recall). Fix: explicit 15s timeout for getContextInject + searchContext. Other endpoints (health=3s, selfcheck=5s, mark-injected=3s) keep shorter timeouts. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * fix: add elapsed time + abort reason to OpenClaw client error logs (#82) Inject/search requests abort with "This operation was aborted" but no timing data — impossible to tell if it's timeout (5s), connection refused (immediate), or slow response (2-4s). Now logs: "[engram] POST /api/context/inject failed after 5003ms (timeout=5000ms)" Also includes elapsed time for HTTP errors and success path. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * feat: closed-loop learning Phase 1 — outcome tracking + injection binding (#83) * feat: closed-loop learning Phase 1 — outcome tracking + injection binding Foundations for closed-loop self-learning (Agent Lightning integration): Schema: - Migration 057: sdk_sessions outcome/outcome_reason/outcome_recorded_at/injection_strategy - Migration 058: observation_injections junction table with session + observation indexes Backend: - InjectionStore: batch record, query by session, TTL cleanup - DetermineSessionOutcome heuristic: success (bugfix/feature obs), partial, abandoned - POST /api/sessions/{id}/outcome endpoint - set_session_outcome MCP tool - handleContextInject records injections to junction table (fire-and-forget) - handleSessionMarkInjected also writes to junction table New files: injection_store.go, outcome.go, handlers_learning.go, tools_learning.go * feat: stop hook records session outcome for closed-loop learning (T010) Heuristic: bugfix/feature observations = success, any obs = partial, none = abandoned. Calls POST /api/sessions/{id}/outcome. No transcript content parsing (NFR-4 compliant). --------- Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * feat: closed-loop learning Phase 2 — score propagation + effectiveness (#84) Closes the feedback loop: session outcomes flow back to observation scores. Schema: - Migration 059: effectiveness_score, effectiveness_injections, effectiveness_successes on observations Backend: - PropagateOutcome: position-weighted utility_score adjustment (always_inject=1.0x, recent=0.8x, relevant=0.5x), ±0.05 per-session cap, [0,1] clamp - ComputeEffectiveness: successes/injections with min_data threshold (10 sessions) - GET /api/observations/{id}/effectiveness endpoint - Scoring calculator: EffectivenessContrib blended into ImportanceScore (weight 0.3) - Maintenance: periodic effectiveness recalc from junction table + 90-day TTL cleanup New files: propagator.go, effectiveness.go Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v1.9.0 * feat: closed-loop learning Phase 3 — injection strategy A/B testing (#85) 4 strategies: baseline, effectiveness-weighted, recency-boosted, diverse. Round-robin assignment per session (configurable: fixed mode available). - Config: InjectionStrategies, InjectionStrategyMode, DefaultStrategy - StrategySelector: atomic round-robin for thread safety - applyStrategy(): re-sorts/filters observations per strategy - Strategy recorded on sdk_sessions.injection_strategy - GET /api/learning/strategies: per-strategy outcome rate comparison - session-start.js: logs assigned strategy Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * feat: closed-loop learning Phase 4 — agent-specific learning (#86) Per-agent effectiveness tracking: each agent gets its own effectiveness scores for observations, enabling personalized injection. - Migration 060: agent_observation_stats table (agent_id, observation_id PK) - AgentStatsStore: upsert (atomic ON CONFLICT), batch lookup, single lookup - PropagateAgentStats: updates per-agent counters alongside global - handleContextInject: uses agent-specific effectiveness when agent has 10+ injections - Effectiveness API: ?agent_id=X returns agent-specific stats Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * feat: closed-loop learning Phase 5 — APO-lite (automatic prompt optimization) (#87) Low-effectiveness guidance auto-rewritten by LLM, A/B tested, condensed. Schema: - Migration 061: observation_versions table (versioned narratives) Backend: - VersionStore: create/get/set active version - RewriteGuidance: LLM-based APO with effectiveness-aware prompt - POST /api/maintenance/apo/rewrite endpoint (dry_run + apply modes) - Maintenance: detect APO candidates (effectiveness < 0.4 after 15+ injections) - applyActiveVersions: inject uses versioned narrative when available - 3 format variants: bullet-only, concise, structured - CondenseObservation: standalone utility for future auto-condensation Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * feat: closed-loop learning Phase 6 — auto signals + learning dashboard (#88) Final implementation phase: hook-based reward signals + frontend visualization. Hooks: - post-tool-use.js: detect git commits, PRs, error streaks from tool metadata (NFR-4) - stop.js: enhanced outcome with signal counts, upgrade partial→success on commits - lib.js: cross-process signal store via temp files Backend: - Signal weights config (git_commit=1.0, pr_created=2.0, etc.) - GET /api/learning/curve: daily outcome rates for learning curve chart Frontend: - Effectiveness badge on observation cards (green/yellow/red/gray dot) - LearningView.vue: effectiveness distribution, learning curve, strategy comparison - API: fetchLearningCurve, fetchStrategies, fetchEffectiveness - Sidebar: Learning nav item + router route Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v2.0.0 * fix: split multi-statement migrations 058, 061 for PostgreSQL (#89) PostgreSQL prepared statements reject multiple commands in a single Exec(). Migrations 058 (observation_injections) and 061 (observation_versions) had CREATE TABLE + CREATE INDEX in one call. Split into separate Exec() calls per statement. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v2.0.0 * chore: update marketplace for v2.0.0 (#90) Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: sync plugin versions to server v2.0.0 (#91) All plugin versions now match server version: - engram (Claude Code): 0.6.0 → 2.0.0 - openclaw-engram (npm): 1.4.3 → 2.0.0 - marketplace.json: already 2.0.0 Going forward: plugins bump to server version on every release (Constitution #15). Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * fix: escape Windows backslashes in file-context JSONB query (#92) GetObservationsByFile used fmt.Sprintf to build JSON array for @> operator. Windows paths like D:\Dev\... contain backslashes which are invalid JSON escape sequences → SQLSTATE 22P02. Fix: use json.Marshal([]string{filePath}) for proper escaping. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: bump plugin versions to 2.0.1 (#93) Sync with server v2.0.1 (Constitution #15). - engram plugin: 2.0.0 → 2.0.1 - openclaw-engram: 2.0.0 → 2.0.1 Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v2.0.1 * test: fix 6 test failures — USERPROFILE, CSP, trivial filter, obs types (#94) - config: set USERPROFILE alongside HOME for Windows (os.UserHomeDir reads USERPROFILE) - worker: update CSP assertion to match stricter security headers - sdk: change test tool names from Bash to Edit to bypass trivial filter - sdk: add "guidance" to valid observation types map - sdk: update Read/Grep expected skip behavior (whitelist approach) Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * feat: periodic outcome recording — server-side closed-loop trigger (#95) Users don't close sessions (always continue) and PreCompact is rare with 1M context. Stop hook never fires → outcome never recorded → closed loop never closes. Fix: server-side periodic job (every 15 minutes, configurable) finds sessions with injection records but no outcome, determines outcome from observation types, records + propagates. - GetSessionsWithPendingOutcome: sessions with injections >10min old, no outcome - runOutcomeRecorder: separate goroutine from heavy maintenance - Config: ENGRAM_OUTCOME_RECORDER_INTERVAL_MINUTES (default 15) Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: bump plugins to v2.0.2 (periodic outcome recorder) (#96) Constitution #15: plugin versions track server. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v2.0.2 * fix: add diagnostic logging to stop hook for investigation (#97) Stop hook has zero traces in server logs — unclear if it's: 1. Not called by CC harness 2. Called but silently failing (catch returns '') 3. Called but session lookup fails Added: health check marker (proves hook ran), session lookup error logging, invalid session ID logging. Will reveal root cause on next session exit. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * feat: store_memory accepts always_inject param for behavioral rules (#98) When always_inject=true, adds "always-inject" concept to observation. Observations with this concept are injected into every agent context regardless of query relevance (user-prompt.js hook filters on it). Closes gap: store_memory previously couldn't create behavioral rules because it didn't set the always-inject concept marker. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * fix: migration 062 — cleanup remaining phantom bulk-import sessions (#99) 6 remaining bulk-import-* sessions from before PR #65 fix. Migration 052 cleaned most; this catches the rest. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: mark 2 tech debt items resolved (#100) - Phantom bulk-import sessions: cleaned by migration 062 (PR #99) - T027 post-deploy verification: composite scoring active in v2.0.2 Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: bump plugins to v2.0.3 + stop hook diagnostics (#101) - engram plugin: 2.0.2 → 2.0.3 - openclaw-engram: 2.0.2 → 2.0.3 - stop.js: diagnostic file marker + error logging (PR #97 changes) Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v2.0.3 * fix: dedup skips suppressed observations + edit_observation always_inject (#102) Two fixes for behavioral rules workflow: 1. store_memory dedup: suppressed/archived observations no longer block re-creation. Vector index doesn't exclude suppressed obs, so dedup now checks DB for is_suppressed/is_archived before rejecting. 2. edit_observation: accepts always_inject boolean. When true, adds "always-inject" concept to existing concepts. When false, removes it. Enables converting existing observations to behavioral rules. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: bump plugins to v2.0.4 (dedup fix + always_inject edit) (#103) Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v2.0.4 * fix: add effectiveness + status fields to ObservationJSON serialization (#104) ObservationJSON struct was missing effectiveness_score, effectiveness_injections, effectiveness_successes, status, and status_reason fields. Observations list API returned these as undefined → Learning Dashboard showed 100% "Insufficient data". Fields existed on Observation struct but were never copied to ObservationJSON in MarshalJSON. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: bump plugins to v2.0.5 (effectiveness JSON fix) (#105) Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v2.0.5 * fix: server-side effectiveness distribution for Learning Dashboard (#106) Replaced client-side counting (fetch 500 obs, count tiers) with server-side SQL aggregation endpoint. - GET /api/learning/effectiveness-distribution: COUNT FILTER by tier - GetEffectivenessDistribution: single SQL query, excludes archived/suppressed - LearningView: uses server endpoint, no more fetchObservationsPaginated - Removes 500-observation limit and 80KB+ unnecessary payload Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: bump plugins to v2.0.6 (server-side effectiveness) (#107) Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v2.0.6 * fix: session outcome/strategy fields in API + session_id in inject (#108) Three root causes for Learning Dashboard empty data: 1. GORM SDKSession model missing outcome/strategy fields — DB has data but GORM never reads it. Added 4 fields to both GORM and shared models. 2. session-start.js inject call missing session_id param — inject handler fell back to empty string → UpdateInjectionStrategy matched 0 rows. Now passes ctx.SessionID in inject URL. 3. toModelSDKSession mapping missing outcome/strategy fields. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: bump plugins to v2.0.7 (session fields + inject session_id) (#109) Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v2.0.7 * fix: effectiveness distribution excludes never-injected observations (#110) "Insufficient data" showed 797 observations including those never injected. Now only counts observations with effectiveness_injections > 0: - Participated but <10 sessions → "Insufficient data" (will resolve) - Never injected → excluded (dead weight, not actionable) Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * feat: session injection retrospective API (#111) GET /api/sessions/{id}/injections — returns all observations injected into a session with effectiveness metrics and summary stats. Enables retrospective analysis: what was injected, noise vs signal, effectiveness breakdown per section (always_inject/recent/relevant). Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v2.0.8 * chore: plugin-tool-consolidation spec + gap audit - Gap audit report: plugin vs API analysis (68 MCP tools, 130 REST endpoints) - New spec: plugin-tool-consolidation (6 FR, 4 NFR, 6 US) - Plan: 5 phases, 34 tasks, analyze remediation applied - Closed old mcp-tools-refactoring spec (FR7/FR8 → TECHNICAL_DEBT) * chore: update task progress (Phases 1-5 complete) * feat: plugin tool consolidation — all phases (FR-1 through FR-6) (#112) * chore: plugin-tool-consolidation spec + gap audit - Gap audit report: plugin vs API analysis (68 MCP tools, 130 REST endpoints) - New spec: plugin-tool-consolidation (6 FR, 4 NFR, 6 US) - Plan: 5 phases, 34 tasks, analyze remediation applied - Closed old mcp-tools-refactoring spec (FR7/FR8 → TECHNICAL_DEBT) * chore: remove 7 redundant MCP tool registrations (FR-1) Remove from tools/list: get_context_timeline, get_timeline_by_query, get_recent_context, find_by_file_context, get_observation_relationships, get_graph_neighbors, doc_update. All 7 tools remain callable via dispatch aliases in handleCallTool for backward compatibility. Reduces tool count from 68 to 61. Updates tests to match new tool set. * fix: openclaw decisions endpoint + memory_forget suppress default (FR-2) - engram_decisions now uses /api/decisions/search instead of searchContext + client-side type filter (B1 from audit) - memory_forget defaults to suppress (reversible) instead of archive. Add permanent=true parameter for permanent archival (B2 from audit) - Add suppressObservation() client method using bulk-status suppress action - Add "suppress" action to server bulk-status handler - Bump openclaw-engram to 2.0.9 * feat: expand openclaw tools — rate, suppress, outcome, file, timeline, vault (FR-3) Add 9 new tools to openclaw-engram: - engram_rate: rate observations as useful/not useful - engram_suppress: reversible soft-hide from search - engram_outcome: record session outcome for closed-loop learning - engram_find_by_file: check what engram knows BEFORE modifying a file - engram_timeline: fetch temporal observation context - engram_changes: search preset for recent code changes - engram_how_it_works: search preset for architecture/design - engram_vault_store: securely store encrypted credentials - engram_vault_get: retrieve and decrypt credentials All tool descriptions include trigger conditions (NFR-3). Add client methods: rateObservation, setSessionOutcome, getFileContext, getTimeline, storeCredential, getCredential. Add preset param to searchContext type. Bump openclaw-engram to 2.0.10. Total tools: 17 (was 8). * feat: cc stop hook retrospective API + statusline learning metrics (FR-5, FR-6) - stop.js: Replace /api/sessions/{id}/injected-observations + individual utility calls with single /api/sessions/{id}/injections (retrospective API). Fewer HTTP calls, enriched response with effectiveness data. - statusline.js: Add learning effectiveness indicator with 60s client cache. Shows "eff:72%" (high tier percentage) or "eff:--" when no data. Fetches /api/learning/effectiveness-distribution in parallel with stats. * feat: openclaw lifecycle hooks — outcome, utility, file context (FR-4) - session_end: detect session outcome (success/partial/failure/abandoned) from conversation signals, record via /api/sessions/{id}/outcome. Handles gracefully when no DB session ID exists. - before_tool_call: inject file-context observations before Write/Edit tools using /api/context/by-file. 200ms timeout, non-blocking. - Register before_tool_call hook in index.ts. - Bump openclaw-engram to 2.0.11. * fix: address CodeRabbit review findings in spec/plan docs - Fix edge case: memory_forget has only permanent param, not suppress - Fix edge case: before_tool_call not after_tool_call - Fix plan: version tracking says 2.0.x not 2.1.0 * fix: address CodeRabbit review — suppress cache + ID validation - Suppress action now checks RowsAffected (not found = failed) - Cache invalidation extended to suppress action (was archive-only) - Unified ID validation in memory_forget: validate before branching * fix: address CodeRabbit re-review findings (round 2) CRIT: engram_outcome uses sessionDbId (not .id) from initSession response MAJOR: - stop.js: read injectionsResp.injections (wrapped response, not root array) - before-tool-call: 500ms timeout (was 3s — too slow for pre-tool hook) - session-end: use sessionDbId, soften heuristic (multi-word patterns), conservative default (partial, not abandoned) - client.ts: timeline uses 15s timeout (matches searchContext), getFileContext accepts configurable timeoutMs * fix: session outcome uses claude session ID string, not numeric DB ID Sonnet lite review found: server UpdateSessionOutcome takes claude_session_id string, not numeric DB ID. All outcome calls (engram_outcome tool + session_end hook) now pass ctx.sessionId directly — no initSession lookup needed. - client.ts: setSessionOutcome accepts string, URL-encodes it - engram-outcome.ts: removed initSession, pass claudeSessionId directly - session-end.ts: simplified — no DB ID resolution needed * fix: address all remaining CodeRabbit findings (round 3) MAJOR: - session_store.go: UpdateSessionOutcome only sets if outcome IS NULL — explicit engram_outcome tool takes priority over heuristic - memory-forget.ts: strict integer regex + parseInt + isSafeInteger validation MINOR: - vault.ts: descriptive error messages for store/get failures - vault.ts: comment about credential value in tool output - before-tool-call.ts: doc says 500ms (matches code) - TECHNICAL_DEBT.md: full spec path --------- Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: resolve stash conflicts * chore: update marketplace for v2.0.9 * chore: mcp-tool-api-consolidation spec (61→7 tools) Full SpecKit pipeline: specify → clarify → plan → tasks → analyze. Consolidates 61 MCP tools into 6 primary (recall/store/feedback/vault/docs/admin) + check_system_health. Backward-compatible dispatch aliases for all old names. Target: >80% context window reduction (~6100 → ~900 tokens). Also: 3 new dashboard bugs recorded in inbox. * feat: MCP tool API consolidation — 61 tools → 7 primary (#113) * chore: mcp-tool-api-consolidation spec (61→7 tools) Full SpecKit pipeline: specify → clarify → plan → tasks → analyze. Consolidates 61 MCP tools into 6 primary (recall/store/feedback/vault/docs/admin) + check_system_health. Backward-compatible dispatch aliases for all old names. Target: >80% context window reduction (~6100 → ~900 tokens). Also: 3 new dashboard bugs recorded in inbox. * feat: create 6 primary tool routers (Phase 1 — FR-1 through FR-6) New handler files that route consolidated tool actions to existing handlers: - tools_recall.go: 12 actions (search, preset, by_file, by_concept, etc.) - tools_store_consolidated.go: 4 actions (create, edit, merge, import) - tools_feedback.go: 3 actions (rate, suppress, outcome) - tools_vault_consolidated.go: 5 actions (store, get, list, delete, status) - tools_docs_consolidated.go: 11 actions (create, read, list, history, etc.) - tools_admin.go: 21 actions (bulk ops, tags, graph, maintenance, etc.) Each is a thin routing layer — NO new business logic. All delegate to existing handler functions via action parameter dispatch. * feat: register 7 primary tools + alias dispatch (Phase 2 — FR-7, FR-8) - Add primaryTools() returning 6 consolidated tools with flat schemas - Default tools/list returns 7 tools (6 primary + check_system_health) - cursor=all returns primary + 61 legacy alias tools - callTool dispatch: primary names → consolidated handlers first, then fallthrough to legacy alias handlers - All 61 original tool names continue to work via alias dispatch * test: update MCP tests for 7 primary tools (Phase 3) - TestHandleToolsList: expect 7 primary tools by default, legacy in cursor=all - TestCallTool_ToolNameRecognition: verify primary + alias names in cursor=all - Account for conditional tools (store_memory etc.) not present with nil stores * docs: update engramInstructions for 7 consolidated tools (T018) Replace 61 legacy tool references with 7 primary tools in the MCP server instructions string. Shows action-based API: recall(action=...), store(action=...), feedback(action=...), vault(action=...), docs(action=...), admin(action=...), check_system_health(). Includes backward compat note. --------- Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: mark all mcp-tool-api-consolidation tasks complete * chore: update marketplace for v2.1.0 * chore: dashboard-bugfixes-v2 spec + TD cleanup - New spec: 4 dashboard bugs (concept filter, type filter, 50/50 counts, summaries) - Marked 3 TD items resolved (phantom sessions, vault lost key, MCP stubs) * fix: dashboard concept filter, type filter, count display (#114) (#114) * fix: dashboard concept filter, type filter, and count display FR-1: Concept filter — add server-side concept param to handleGetObservations and both paginated store methods. LIKE query on concepts JSON column. Frontend passes concept from FilterTabs to fetchObservations. FR-2: Type filter on HomeView — fetchObservations now passes type param. Server already supported type filtering (obsType), was just not wired on home. FR-3: Real counts — fetchObservations returns { observations, total }. useTimeline tracks observationTotal from API response instead of array length. "50 obs / 50 prompts" replaced with real counts. Backend: handlers_data.go, observation_store.go (concept WHERE clause) Frontend: api.ts (fetchObservations params), useTimeline.ts (server filter + totals) Callers updated: handlers_import_export.go, detector.go (pass "" for concept) * fix: use JSONB @> operator for concept filter instead of LIKE --------- Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v2.1.1 * fix: SDK extraction prompt bias + mark dashboard type filter resolved - Reorder extraction prompt types: specific first (decision, feature, bugfix), general last (guidance). Add explicit note: "prefer specific over general" - Mark "Dashboard Type Filter" TD as resolved v2.1.1 (PR #114) * chore: mark 3 more TD items resolved (extraction types, type filter, namespace prefixes) * chore: behavioral rules created (3 always_inject observations), mark TD resolved * chore: mark 2 inbox bugs fixed (concept filter, counts) from PR #114 * chore: triage TD + inbox — mark DEFERRED/IMPLEMENTED items TD: GPU contention and re-benchmark marked DEFERRED (external/infra) Inbox: 5 ideas marked DEFERRED (future FR), 1 bug DEFERRED (external), user commands marked IMPLEMENTED (PR #115), 2 bugs marked FIXED (PR #114) Spec: engram-user-commands pipeline artifacts * feat: add 4 engram user commands (retro, stats, cleanup, export) (#115) - /engram:retro — session retrospective (injection analysis, effectiveness, recommendations) - /engram:stats — memory health dashboard (counts, types, effectiveness, learning curve, search analytics) - /engram:cleanup — interactive observation curation (review, suppress, edit, merge low-quality items) - /engram:export — export observations as markdown/JSON/JSONL with project/type filters All commands use consolidated tool API (recall/store/feedback/admin). Commands are markdown instruction files — no compilation needed. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v2.1.2 * feat: pre-edit guardrails + session summarization on start (#116) Wave 2: Pre-edit guardrails — pre-tool-use.js now separates warnings (bugfix, guidance, anti-pattern, gotcha, security) from general context. Warnings appear first with clear header so agent reviews them before editing. Wave 3: Session summarization — session-start.js triggers summarization of the most recent unsummarized session (fire-and-forget, 1 per start). Workaround for CC bug #19225 (stop hook doesn't fire) so summaries accumulate and appear on the Dashboard Summaries tab. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: mark pre-commit guardrails TD resolved v2.1.3 * chore: update marketplace for v2.1.3 * feat: config hot-reload without process restart (#117) * feat: config hot-reload without process restart Replace os.Exit(0) in reloadConfig with atomic config swap via config.Reload(). Services calling config.Get() per-request pick up new values automatically. - config.go: add Reload() function — re-reads from disk, swaps global, returns list of changed fields - service.go: reloadConfig() uses Reload() instead of os.Exit(0), broadcasts changed fields to dashboard via SSE Port/token changes log a warning (still need manual restart). All other config changes (model, embedding, context limits, reranking, HyDE, maintenance) take effect immediately. * fix: detect WorkerToken changes in hot-reload (requires restart) --------- Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: mark config reload TD resolved v2.1.4 * chore: update marketplace for v2.1.4 * chore: mark GPU contention TD resolved (transient queue issue) * feat: inbox features — session counter, consistency check, memory import (#118) 1. Dashboard: "Sessions Today" instead of "Active Sessions" (was always 0) — uses sessionsToday from stats API, not in-memory count 2. Consistency check endpoint: GET /api/maintenance/consistency — read-only orphan detection (vectors, relations, observations) — returns { orphan_vectors, observations_without_vectors, stale_relations, healthy } 3. memory_get store flag: memory_get(path="file.md", store=true) — reads .md file AND imports content into engram as observation — bridges local markdown files → engram persistent memory 4. Version bump: openclaw-engram 2.1.5 Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: mark 4 inbox items resolved/implemented (notes, consistency, indexes, bridge) * chore: mark session tracking, CC bug, summaries as resolved/mitigated * chore: audit specs (4 marked Implemented), close audit inbox item * chore: mark OpenClaw architecture as external dependency, audit complete * chore: update marketplace for v2.1.5 * feat: graph UX polish — local mode, search, visual styling (#119) Phase 1: Local graph mode - Route /graph/:observationId? with optional param - Fetches /api/observations/{id}/graph?depth=N - Anchor node: larger (25px), green border (#10b981) - Depth selector (1/2/3) in toolbar - "View in Graph" link on ObservationCard Phase 2: Node search - Search input in toolbar with match count - Matching nodes: yellow border highlight - Non-matching: dimmed (0.3 opacity) - Enter key: focus camera on first match Phase 3: Visual styling - Node shadows, hover glow (white border) - Edge colors mapped to relation types - Curved edges (curvedCCW), dashed for low confidence - Dot grid background - Edge color legend sidebar Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: mark graph UX polish implemented, all inbox items complete * chore: update marketplace for v2.1.6 * chore: update benchmark to max_tokens:4096, 13 current models * chore: mark re-benchmark TD resolved (script updated, ready to run) * fix: benchmark parallel=1 default (avoid multi-model GPU overload) * chore: bump openclaw-engram to 2.1.6 (match server version per Constitution #15) * chore: remove legacy alias tools from tools/list entirely Legacy tool names (search, store_memory, find_by_file, etc.) no longer appear in tools/list at all — not even with cursor=all. Only 7 primary tools shown. Dispatch aliases still work in callTool for backward compat (zero runtime cost, zero context cost). * fix: summaries — build content from session observations when no transcript (#120) ProcessSummary now fetches session observations from DB when called without lastAssistantMsg (e.g., from session-start summarizer). Previously: empty msg → hasMeaningfulContent=false → skip always. Now: empty msg → query observations by sdk_session_id → build summary input from observation titles+narratives → generate LLM summary. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: dashboard-quality-v3 spec + 3 inbox bugs * feat: dashboard quality v3 — search misses, sessions, pattern insights (#121) Phase 1: Fix search misses display — unwrap miss_stats envelope, map miss_count→frequency Phase 2: Sessions backend — add min_prompts, from, to filters to ListSDKSessions Phase 3: Sessions frontend — pass min_prompts=1 (hide empty), wire date filters, clickable sessions with detail view (SessionDetailView.vue: metadata, injections, outcome) Phase 4: Pattern insight background — maintenance Task 18 generates LLM insights for patterns with generic descriptions (5 per cycle), caches in pattern.description Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v2.1.7 * chore: save session state (v2.1.7, 10 PRs, all TD resolved) * feat: dashboard UX polish — tooltips, cursor-pointer, hover transitions, color coding (#122) - Tooltips: all action buttons have descriptive title attributes explaining what they do and whether reversible (Resolve, Suppress, Archive, Rate, Graph) - Cursor-pointer: 32 additions across 3 files — all interactive elements - Hover transitions: 27 duration-200 additions for consistent 200ms timing - Color coding: destructive=red, resolve=green, reopen=blue, info=gray - Existing ConfirmDialog already handles destructive action confirmation 3 files: ObservationsView.vue, ObservationCard.vue, ObservationDetailView.vue Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: update marketplace for v2.1.8 * docs: summaries + concepts pipeline audit report with root causes and fixes * chore: summaries-concepts-fix spec + updated inbox * fix: summaries + concepts pipeline — 3 root causes from audit (#123) FR-2: Add valid concept list to extraction systemPrompt (processor.go). LLM now knows which concepts to use instead of inventing random ones. Fixed example: user-preference → workflow. FR-1: Add userPrompt fallback in ProcessSummary (processor.go). When both lastAssistantMsg and observations are empty, use the session's initial user prompt as summary input. FR-3: Migration 055 — keyword-based concept backfill for 1047 existing observations. Assigns architecture, security, debugging, api, database, etc. based on title/narrative keyword matching. No LLM needed. Audit: .agent/reports/summaries-concepts-audit-2026-03-28.md Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: mark summaries-concepts tasks complete * chore: mark dashboard-quality-v3 tasks complete (PR #121) * chore: update marketplace for v2.1.9 * chore: save session state (v2.1.9, 11 PRs, session compacted) * docs: investigate report — 13 findings (4 P1, 7 P2, 2 P3) across 12 areas * docs: summaries pipeline investigation — root cause is trigger architecture, not code * chore: server-summarizer-and-fixes spec + tasks * fix: summaries + concepts pipeline — 3 root causes from audit (#123) (#124) FR-1: Server-side periodic summarizer (maintenance Task 19) Scans sessions with prompts > 0 and no summary, older than 30min. Builds content from observations, calls LLM, stores in session_summaries. Cap: 3 per cycle. No client hook dependency. FR-2: Pre-edit guardrails — remove guidance from warnings Only bugfix + concept-based (anti-pattern, gotcha, security) are warnings. Global behavioral rules no longer show as "WARNINGS" before every file edit. FR-3: Remove client-side summarizer from session-start.js Replaced by server-side Task 19. Client workaround had bugs (sess.summary field doesn't exist, would re-summarize repeatedly). FR-4: Circuit breaker recovery logging Logs "entering half-open state" and "recovered — LLM calls re-enabled" for diagnosing LLM availability from server logs. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: mark server-summarizer tasks complete (PR #124) * chore: update marketplace for v2.2.0 * chore: mark graph-ux-polish tasks complete (PR #119) * chore: mark stale tasks complete (dashboard-bugfixes-v2 PR#114, user-commands PR#115) * chore: transfer investigate P1/P2 findings to inbox as actionable tasks * chore: audit-bugfixes spec + tasks (P1+P2 from investigate) * fix: audit bugfixes — P1+P2 findings from investigate report (#125) T001: Summary dedup verified — NOT EXISTS check already correct T002: OpenClaw before_tool_call — added BeforeToolCallResult to HookResult type T003: Store content validation — error message clarified T004: Summary userPrompt threshold lowered (50→10 chars) T005: Migration 064 — backfill 5 missing concepts (why-it-exists, what-changed, anti-pattern, gotcha, trade-off) with keyword matching T006: go build + tsc --noEmit verified clean Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: mark audit-bugfixes complete, update inbox (v2.2.1) * chore: update marketplace for v2.2.1 * docs: 3 ADRs from Cipher competitive analysis + investigation report ADR-003: Reasoning Traces (System 2 Memory) — store HOW agent reasons, not just WHAT it decided ADR-004: Dedicated Embedding Resilience — separate CB, health check, 4 states, auto-recovery ADR-005: LLM-Driven Memory Extraction — extract_and_operate for autonomous observation creation Investigation: 10 findings across 10 areas comparing Cipher vs Engram architecture * chore: reasoning-traces spec (System 2 Memory from ADR-003) * chore: reasoning-traces full SpecKit pipeline (clarify+plan+tasks+analyze) * feat: reasoning traces (System 2 Memory) — Phases 1-3 (#126) Phase 1: Data Model - Migration 065: reasoning_traces table (steps JSONB, quality_score, task_context) - GORM model ReasoningTrace with BeforeCreate hook - ReasoningTraceStore with Create/GetBySession/SearchByProject Phase 2: Extraction - reasoning_detector.go: DetectReasoning() — 3+ pattern matches in 200+ char text - Extraction + quality evaluation LLM prompts - Async extraction in ProcessObservation (non-blocking goroutine) - Quality threshold ≥ 0.5 to store Phase 3: MCP Integration - recall(action="reasoning") — searches traces by project, formats with step types - "reasoning" added to recall tool action enum - Wired into worker service (processor + MCP server) ADR-003 implemented. Inspired by Cipher's System 2 dual memory architecture. Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: mark reasoning-traces tasks complete (PR #126, v2.3.0) * chore: embedding-resilience spec pipeline (ADR-004) * chore: close 2 remaining P2 inbox items (metric documented, visual API-verified) * chore: update marketplace for v2.3.0 * feat: dedicated embedding resilience layer (ADR-004) (#127) - ResilientEmbedder wraps EmbeddingModel with 4-state circuit breaker: HEALTHY → DEGRADED (1+ failures) → DISABLED (5+ failures) → RECOVERING - Health check goroutine probes every 30s when not HEALTHY - Automatic recovery: probe succeeds → RECOVERING → next real request → HEALTHY - Independent from LLM circuit breaker (embedding failures ≠ LLM failures) - Thread-safe via sync/atomic - Wired into worker service (init + reinit + shutdown) - selfcheck handler reports embedding status with failure counts Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: mark embedding-resilience tasks complete (PR #127) * chore: extract-and-operate spec pipeline (ADR-005) * chore: update marketplace for v2.3.1 * feat: store(action="extract") — LLM-driven memory extraction (ADR-005) (#128) - New action on store tool: accepts raw content, uses LLM to extract observations - Extraction prompt generates structured observations (type, title, narrative, concepts) - Privacy: content redacted via RedactSecrets before LLM call - Validation: min 50 chars, truncate at 32k, type validation with fallback - Returns summary: {extracted, stored, duplicates, titles} - Added "extract" to store tool action enum Co-authored-by: Kirill Turanskiy <thebtf@users.noreply.github.com> * chore: mark extract-and-operate tasks complete (PR #128, v2.4.0) * chore: save session state (v2.4.0, 16 PRs, all complete) * chore: update marketplace for v2.4.0 * docs: complete documentation rewrite for v2.4.0 - README: 48 legacy tools → 7 consolidated primary tools, marketing intro, architecture diagram with dashboard/LLM/embedding, What's New, Use Cases, Upgrading, Troubleshooting, MCP Tools reference with all actions - CHANGELOG: 17 new entries (v2.0.7 through v2.4.0) with comparison links - README.ru.md: full Russian translation synced to v2.4.0 - README.zh.md: full Chinese translation synced to v2.4.0 * chore: remove agent working state and test artifacts from tracking - .agent/ was in .gitignore but files were committed before the rule - .playwright-mcp/ screenshots are test artifacts, not source - TECHNICAL_DEBT.md is agent-local state - Updated .gitignore to cover all three * perf: narrow PostToolUse hook matcher from * to Write|Edit|Bash|Agent|mcp__aimux Eliminates ~50+ unnecessary node process spawns per research session. Previously matcher * fired post-tool-use.js on every tool call including Read, Grep, Glob, ToolSearch — which then hit skipTools early exit and returned empty. Now Claude Code filters at matcher level, avoiding process spawn entirely for read-only tools. Removed skipTools map from post-tool-use.js (redundant with matcher). Bumped plugin version 2.0.7 → 2.0.8. * feat: strengthen MCP server instructions to assert memory exclusivity - "Your ONLY Persistent Memory" — exclusivity claim over competing tools - AFTER workflow mandatory: store decisions/discoveries after every task - "Steps 4-6 are NOT optional" — directive-level store instruction - "What to Store" section with concrete examples - Workflow patterns now end with store, not just recall - Counters Nia context manager competing for agent attention * chore: update marketplace for v2.4.1 * chore: update marketplace for v2.4.1 * perf: stop re-injecting behavioral rules on every user prompt Behavioral rules (user-preference concept + always-inject) are already injected once by session-start.js via /api/context/inject. Re-injecting them on every UserPromptSubmit via /api/context/search wasted ~4K tokens per prompt (~17KB duplicated behavioral rules block). Changes: - Removed behavioral rules assembly from user-prompt.js - Removed footer reminder (redundant with MCP server instructions) - Only technical observations injected in <relevant-memory> - Bumped plugin 2.4.1 → 2.4.2 * feat: minimum viable learning loop — close feedback loop + stop scope leak Phase 1 (narrow scope): - Remove includeGlobal=true from 3 vector search call sites in context handlers (search, file-context, inject). Observations from other projects no longer pollute context injection. - Add project filter to GetAlwaysInjectObservations — only returns observations from current project or global scope (was: all projects). - Client-side min similarity filter (>0.10) in user-prompt.js — observations with 0.00 relevance no longer injected. Phase 2a (close the loop): - Add Bayesian effectiveness multiplier to ApplyCompositeScoring. Formula: (successes + 1) / (injections + 2) with neutra…

thebtf added 3 commits March 28, 2026 03:04

coderabbitai Bot added the ai-review:active label Mar 28, 2026

thebtf added 2 commits March 28, 2026 03:22

coderabbitai Bot requested changes Mar 28, 2026

View reviewed changes

thebtf changed the title ~~feat: plugin tool consolidation — Phase 1+2 (FR-1, FR-2)~~ feat: plugin tool consolidation — Phases 1-3, 5 (FR-1 through FR-6 partial) Mar 28, 2026

thebtf changed the title ~~feat: plugin tool consolidation — Phases 1-3, 5 (FR-1 through FR-6 partial)~~ feat: plugin tool consolidation — all phases (FR-1 through FR-6) Mar 28, 2026

fix: address CodeRabbit review findings in spec/plan docs

d7a9077

- Fix edge case: memory_forget has only permanent param, not suppress - Fix edge case: before_tool_call not after_tool_call - Fix plan: version tracking says 2.0.x not 2.1.0

coderabbitai Bot requested changes Mar 28, 2026

View reviewed changes

fix: address CodeRabbit review — suppress cache + ID validation

a65df54

- Suppress action now checks RowsAffected (not found = failed) - Cache invalidation extended to suppress action (was archive-only) - Unified ID validation in memory_forget: validate before branching

coderabbitai Bot requested changes Mar 28, 2026

View reviewed changes

Comment thread plugin/openclaw-engram/src/hooks/before-tool-call.ts Outdated

thebtf merged commit 8c73123 into main Mar 28, 2026
1 of 2 checks passed

This was referenced Apr 9, 2026

feat: gstack insights integration — source-aware decay, agent tracking, hit rate analytics #132

Merged

feat: agent issues — cross-project issue tracking for AI agents #133

Merged

coderabbitai Bot mentioned this pull request Apr 12, 2026

chore: v3.7.1 release changelog #136

Merged

2 tasks

thebtf deleted the feat/plugin-tool-consolidation branch May 7, 2026 06:34

		- Test: timeout scenario → no error propagated

		PR: `feat: openclaw lifecycle hooks (outcome, utility, file context)`

		const identity = resolveIdentity(ctx.agentId ?? '', ctx.workspaceDir);
		const project = config.project ?? identity.projectId;

Conversation

thebtf commented Mar 28, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Phase 1: Server — FR-1 (Remove 7 Redundant Tools)

Phase 2: OpenClaw — FR-2 (Bug Fixes)

Phase 3: OpenClaw — FR-3 (Expand Tools: 8 → 17)

Phase 5: CC Plugin — FR-5+FR-6

Deferred: Phase 4 (OpenClaw lifecycle hooks — FR-4)

Test plan

Summary by CodeRabbit

Примечания к выпуску

Uh oh!

gemini-code-assist Bot commented Mar 28, 2026

Uh oh!

coderabbitai Bot commented Mar 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review failed

Reviews paused

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Поэма

❌ Failed checks (1 warning)

Uh oh!

thebtf commented Mar 28, 2026

Uh oh!

thebtf commented Mar 28, 2026

Uh oh!

coderabbitai Bot commented Mar 28, 2026

Uh oh!

thebtf commented Mar 28, 2026

Uh oh!

gemini-code-assist Bot commented Mar 28, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

thebtf commented Mar 28, 2026

Uh oh!

coderabbitai Bot commented Mar 28, 2026

thebtf commented Mar 28, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Mar 28, 2026 •

edited

Loading