feat(core): enhanced anchored iterative context compression with self-verification by rmedranollamas · Pull Request #15710 · google-gemini/gemini-cli

rmedranollamas · 2025-12-30T12:15:18Z

Summary

This PR implements an enhanced context compression mechanism inspired by Factory.ai's research on evaluating context compression for AI agents. It shifts from simple summarization to a structured, anchored, and self-verifying distillation process.

Details

Structured XML Snapshot: Updated the <state_snapshot> schema in packages/core/src/core/prompts.ts to include:
- <artifact_trail>: Tracks the evolution of critical files and symbols.
- <active_constraints>: Explicitly preserves user-defined or discovered constraints.
- <task_state>: Refined from 'plan' to emphasize continuity and the immediate next focus.
Anchored Iterative Logic: ChatCompressionService now detects existing snapshots in history and instructs the model to integrate and update them rather than starting from scratch, preventing 'information drift'.
Self-Verification Turn ('Probe'): Implemented a two-turn process. After the initial summary, a second turn asks the model to critique its own summary against the original history and provide a refined version if any technical details or constraints were omitted.
Improved Test Coverage: Updated existing tests and added new cases to verify the two-turn flow and anchored integration.

Related Issues

Closes: #16500

How to Validate

Run the core unit tests:

npx vitest packages/core/src/services/chatCompressionService.test.ts

Pre-Merge Checklist

Updated relevant documentation and README (if needed)
Added/updated tests (if needed)
Noted breaking changes (if any)
Validated on required platforms/methods:
- Linux
  - npm run

gemini-code-assist · 2025-12-30T12:15:40Z

Summary of Changes

Hello @rmedranollamas, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly upgrades the core context compression capabilities for AI agents, introducing a more robust and intelligent system for managing conversational state. It integrates a self-correction mechanism and anchored updates to existing snapshots, ensuring critical information is retained and refined over time. Additionally, it provides new configuration options for automatic management of MCP servers and experimental model routing, enhancing the system's operational resilience and flexibility.

Highlights

Enhanced Context Compression: Implemented a new context compression mechanism for AI agents, moving beyond simple summarization to a structured, anchored, and self-verifying distillation process, inspired by Factory.ai's research.
Structured XML Snapshot Schema Update: The <state_snapshot> schema in packages/core/src/core/prompts.ts has been updated to include new elements like <artifact_trail> (for tracking file/symbol evolution), <active_constraints> (for preserving explicit rules), and a refined <task_state>.
Anchored Iterative Logic: The ChatCompressionService now intelligently detects existing state snapshots in the conversation history and instructs the model to integrate and update them, preventing 'information drift' rather than generating new summaries from scratch.
Self-Verification Turn ('Probe'): A two-turn compression process has been introduced where, after an initial summary, the model is prompted to critique its own output against the original history and provide a refined version if any critical details were omitted.
MCP Server Auto-Restart Configuration: New configuration options (mcp.autoRestart.enabled, mcp.autoRestart.healthCheckIntervalMs, mcp.autoRestart.unhealthyTimeoutMs) have been added to manage the automatic restarting of MCP servers based on health checks.
Experimental Model Router Settings: New experimental settings (experimental.modelRouter.enabled, experimental.modelRouter.simpleTaskModel, experimental.modelRouter.complexTaskModel) are introduced to enable and configure model routing based on task complexity.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces significant enhancements to context compression, including a structured XML snapshot, anchored iteration, and self-verification. It also adds auto-restarting for MCP servers and an experimental model router. My review focuses on potential race conditions and resource management. I've identified a critical issue in the MCP client manager related to unsafe iteration and a high-severity issue in the chat compression service concerning uncancellable background tasks. Addressing these will improve the robustness and efficiency of the new features.

mrcabbage972 · 2026-01-09T22:28:46Z

Very interesting PR! Can you please open a corresponding issue for tracking?

Also, it seems that the PR combines a few different issues: memory compression, mcp auto-restart, model router configuration. It would be easier to review if they are split to individual PR's.

rmedranollamas · 2026-01-13T10:10:14Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces an enhanced context compression mechanism with a two-turn self-verification process and anchored iterative logic, which is a significant improvement. However, a high-severity prompt injection vulnerability has been identified in the new chat compression logic, where direct use of chat history in prompts could allow an attacker to manipulate the agent's memory and behavior. Additionally, there are two high-severity issues: one related to handling potentially empty summaries that could corrupt chat history, and another concerning the lack of cancellation support for the compression process, potentially leading to resource leaks and race conditions. Addressing these points will make the new feature more robust, reliable, and secure.

rmedranollamas · 2026-01-13T10:44:14Z

filled #16500 for this work, and cleaned the PR up. rebased and passed tests.

bdmorgan · 2026-01-15T03:33:40Z

Hi @rmedranollamas, thank you so much for your contribution to Gemini CLI! We really appreciate the time and effort you've put into this.

We're making some updates to our contribution process to improve how we track and review changes. Please take a moment to review our recent discussion post: Improving Our Contribution Process & Introducing New Guidelines.

Key Update: Starting January 26, 2026, the Gemini CLI project will require all pull requests to be associated with an existing issue. Any pull requests not linked to an issue by that date will be automatically closed.

Thank you for your understanding and for being a part of our community!

rmedranollamas · 2026-01-16T08:23:21Z

attached to #16500

NTaylorMullen · 2026-01-19T02:14:05Z

Looks like you may have some test failures here too

rmedranollamas · 2026-01-19T10:52:25Z

/gemini review

- Add security warnings against prompt injection in compression prompt. - Clarify artifact_trail purpose in compression prompt. - Fix unsafe iteration in McpClientManager.restart. - Ensure all chatCompressionService tests use correct tokenLimit mocks.

rmedranollamas · 2026-01-19T10:56:08Z

I've addressed the review comments:

Security: Added explicit warnings against prompt injection in the compression prompt.
Robustness: Added checks for empty/whitespace summaries to prevent context corruption.
Cancellation: Ensured abortSignal is correctly propagated through all stages of compression.
MCP Client Manager: Fixed an unsafe iteration in restart() by using Array.from(this.clients.keys()).
Clarity: Updated the artifact_trail description in the prompt to clarify it tracks evolution and reasoning, as suggested by @NTaylorMullen.
Tests: Fixed some test cases that were incorrectly assuming token limits.

All tests passed locally.

gemini-code-assist

Code Review

This pull request introduces a sophisticated enhancement to the chat history compression mechanism, incorporating an anchored, iterative approach with a self-verification step. The changes are well-implemented, with a more structured XML snapshot format and corresponding updates to the compression service and tests. The two-turn verification process is a clever way to improve summary quality. I've identified one high-severity issue in how the final summary is determined, which could lead to discarding a valid summary if the verification step returns only whitespace. Overall, this is a great improvement.

rmedranollamas · 2026-01-19T11:15:45Z

Fixed the subtle bug in finalSummary logic where a whitespace-only verification response would cause compression to fail instead of falling back to the initial valid summary. Added a regression test for this case.

- Add security warnings against prompt injection in compression prompt. - Clarify artifact_trail purpose in compression prompt. - Fix unsafe iteration in McpClientManager.restart. - Ensure all chatCompressionService tests use correct tokenLimit mocks.

rmedranollamas · 2026-01-20T07:59:48Z

Fixed integration test failures by:

Adding telemetry logging for the EMPTY_SUMMARY failure case.
Updating integration test mocks to provide responses for both the initial summarization and the new verification turn.

… with self-verification turn based on Factory.ai research

- Add security warnings against prompt injection in compression prompt. - Clarify artifact_trail purpose in compression prompt. - Fix unsafe iteration in McpClientManager.restart. - Ensure all chatCompressionService tests use correct tokenLimit mocks.

…ication is empty - Address gemini-code-assist review comment regarding subtle fallback bug. - Add regression test for empty verification response.

rmedranollamas · 2026-01-20T08:15:49Z

Rebased onto main and verified. Ready for another look! @NTaylorMullen

…-verification (google-gemini#15710)

…ve context compression with self-verification (google-gemini#15710)

…-verification (google-gemini#15710)

gemini-code-assist Bot reviewed Dec 30, 2025

View reviewed changes

Comment thread packages/core/src/tools/mcp-client-manager.ts Outdated

Comment thread packages/core/src/services/chatCompressionService.ts Outdated

rmedranollamas force-pushed the feature/factory-compression branch from b14ada6 to 5cf14a8 Compare December 30, 2025 13:37

rmedranollamas marked this pull request as ready for review December 30, 2025 13:46

rmedranollamas requested a review from a team as a code owner December 30, 2025 13:46

gemini-cli Bot added the status/need-issue Pull requests that need to have an associated issue. label Jan 7, 2026

mrcabbage972 added the area/agent Issues related to Core Agent, Tools, Memory, Sub-Agents, Hooks, Agent Quality label Jan 9, 2026

rmedranollamas force-pushed the feature/factory-compression branch from 611b187 to b3f1342 Compare January 13, 2026 10:05

gemini-code-assist Bot reviewed Jan 13, 2026

View reviewed changes

Comment thread packages/core/src/services/chatCompressionService.ts Outdated

NTaylorMullen approved these changes Jan 19, 2026

View reviewed changes

Comment thread packages/core/src/core/prompts.ts

rmedranollamas force-pushed the feature/factory-compression branch from ffbc0ef to 380afed Compare January 19, 2026 10:50

gemini-code-assist Bot reviewed Jan 19, 2026

View reviewed changes

Comment thread packages/core/src/services/chatCompressionService.ts

NTaylorMullen enabled auto-merge January 19, 2026 20:39

NTaylorMullen approved these changes Jan 19, 2026

View reviewed changes

gemini-cli Bot removed the status/need-issue Pull requests that need to have an associated issue. label Jan 19, 2026

NTaylorMullen force-pushed the feature/factory-compression branch from 50fbdce to f3b47eb Compare January 19, 2026 22:10

auto-merge was automatically disabled January 20, 2026 07:59
Head branch was pushed to by a user without write access

feat(core): implement enhanced anchored iterative context compression…

c16851b

… with self-verification turn based on Factory.ai research

rmedranollamas added 5 commits January 20, 2026 09:12

Cleanup: remove abortSignal passing to keep core client unchanged

b573ebb

feat(core): handle empty summaries in chat compression

39f4063

fix(core): ensure finalSummary falls back to initial summary if verif…

6de64aa

…ication is empty - Address gemini-code-assist review comment regarding subtle fallback bug. - Add regression test for empty verification response.

fix(core): resolve integration test failures on Windows

a5ce377

rmedranollamas force-pushed the feature/factory-compression branch from db5d831 to a5ce377 Compare January 20, 2026 08:15

NTaylorMullen approved these changes Jan 20, 2026

View reviewed changes

NTaylorMullen enabled auto-merge January 20, 2026 08:35

NTaylorMullen added this pull request to the merge queue Jan 20, 2026

Merged via the queue into google-gemini:main with commit 1182168 Jan 20, 2026
25 checks passed

rmedranollamas deleted the feature/factory-compression branch January 20, 2026 08:55

Thomas-Shephard pushed a commit to Thomas-Shephard/gemini-cli that referenced this pull request Jan 21, 2026

feat(core): enhanced anchored iterative context compression with self…

e02abeb

…-verification (google-gemini#15710)

thacio added a commit to thacio/auditaria that referenced this pull request Jan 24, 2026

Merge(Auto) Commit '1182168bd': feat(core): enhanced anchored iterati…

4957519

…ve context compression with self-verification (google-gemini#15710)

kuishou68 pushed a commit to iOfficeAI/aioncli that referenced this pull request Feb 27, 2026

feat(core): enhanced anchored iterative context compression with self…

d91c9e6

…-verification (google-gemini#15710)

kimjune01 mentioned this pull request Mar 21, 2026

feat: Union-find context compaction as alternative to flat summarization #22877

Open

kimjune01 mentioned this pull request Apr 17, 2026

feat(core): union-find context compaction for AgentHistoryProvider #24736

Open

18 tasks

Conversation

rmedranollamas commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Details

Related Issues

How to Validate

Pre-Merge Checklist

Uh oh!

gemini-code-assist Bot commented Dec 30, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

mrcabbage972 commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rmedranollamas commented Jan 13, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

rmedranollamas commented Jan 13, 2026

Uh oh!

bdmorgan commented Jan 15, 2026

Uh oh!

rmedranollamas commented Jan 16, 2026

Uh oh!

Uh oh!

NTaylorMullen commented Jan 19, 2026

Uh oh!

rmedranollamas commented Jan 19, 2026

Uh oh!

rmedranollamas commented Jan 19, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

rmedranollamas commented Jan 19, 2026

Uh oh!

rmedranollamas commented Jan 20, 2026

Uh oh!

rmedranollamas commented Jan 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rmedranollamas commented Dec 30, 2025 •

edited

Loading

mrcabbage972 commented Jan 9, 2026 •

edited

Loading