Skip to content

feat(reflection): add cross-model review between claude-opus and gpt-codex#119

Merged
dzianisv merged 1 commit intomainfrom
feat/cross-model-review
Feb 16, 2026
Merged

feat(reflection): add cross-model review between claude-opus and gpt-codex#119
dzianisv merged 1 commit intomainfrom
feat/cross-model-review

Conversation

@dzianisv
Copy link
Owner

Summary

  • When reflection self-assessment returns complete, the plugin now invokes the opposite frontier model (claude-opus-4.6 ↔ gpt-5.2-codex) in an ephemeral session to independently review the changes. The cross-review result is saved in the .reflection/<session>_<timestamp>.json artifact.
  • Adds a detailed ASCII state graph documenting the full reflection flow to both docs/reflection.md and README.md.
  • Adds missing INTERNAL_SESSION_MARKERS entries (SELF-ASSESS REFLECTION-3, REVIEW REFLECTION-3 COMPLETION) to tts.ts, telegram.ts, and telegram.test-helpers.ts so those plugins correctly skip all reflection-internal sessions.

Changes

File What
reflection-3.ts parseModelSpec(), getCrossReviewModelSpec(), runCrossModelReview(), wired into completion path, updated isJudgeSession()
reflection-3.test-helpers.ts Exported test-safe copies of parseModelSpec and getCrossReviewModelSpec
tts.ts / telegram.ts / telegram.test-helpers.ts Added 2 new markers to INTERNAL_SESSION_MARKERS
test/reflection-3.unit.test.ts Unit tests for cross-model routing logic
test/telegram-session-filter.test.ts Tests for the new session markers
docs/reflection.md / README.md ASCII state graph diagram

Testing

  • npm test — 326 passed, 5 skipped, 8 suites green
  • npm run eval:judge — 23/23 passed
  • npm run eval:stuck — 12/13 passed (1 pre-existing borderline case)
  • npm run eval:compression — 12/12 passed

…codex

When self-assessment completes with 'complete' verdict, the plugin now
invokes the opposite frontier model (claude-opus-4.6 ↔ gpt-5.2-codex)
in an ephemeral session to independently review the changes. The cross-
review result is saved alongside the reflection artifact.

Also adds:
- Detailed ASCII state graph to docs/reflection.md and README.md
- Missing INTERNAL_SESSION_MARKERS for SELF-ASSESS and REVIEW prefixes
  in tts.ts, telegram.ts, and telegram.test-helpers.ts
- Unit tests for parseModelSpec and getCrossReviewModelSpec
- Telegram filter tests for the new session markers
@dzianisv dzianisv merged commit fcb404f into main Feb 16, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant

Comments