[NoQA] Replace err-on-side-of-flagging with self-critique gate in code reviewer by kacper-mikolajczak · Pull Request #85068 · Expensify/App

kacper-mikolajczak · 2026-03-12T12:26:22Z

Explanation of Change

The AI code reviewer currently has an EXCEPTION clause telling it to "err on the side of including" uncertain violations. This causes ~17-23% of review comments to contain visible self-contradiction - the model flags a violation then talks itself out of it in the same comment ("actually this is correct", "upon re-reading, this is fine").

A previous attempt (PR #83184) added a "reality check" instruction that told the model to re-read and confirm violations before posting. This was reported to have the opposite effect - making the model deliberate more visibly in its output.

This PR replaces the EXCEPTION clause with a "self-critique gate" instruction that:

Tells the model to silently verify each violation before including it
Omit any violation where there is doubt (rather than include with caveats)
Never include hedging language ("actually", "upon re-reading", "wait") in violation bodies
Frame uncertain cases as "omit" rather than "check if correct" - avoiding the deliberation trigger

The key behavioral shift: "omit if not confident" instead of "check if correct."

Companion PRs: Auth and Web-Expensify (same change adapted for inline comment posting).

Fixed Issues

$ https://github.com/Expensify/Expensify/issues/605351
PROPOSAL:

Tests

Trigger the AI reviewer on a PR with code that is borderline (could be flagged but is actually correct)
Verify the reviewer does not post self-contradicting comments
Verify that genuine violations are still flagged with clear, definitive language
Verify no hedging phrases ("actually", "upon re-reading", "wait", "however") appear in posted comments

Offline tests

N/A - AI reviewer agent prompt, no offline behavior.

QA Steps

// TODO: These must be filled out, or the issue title must include "[No QA]."

N/A - AI reviewer agent configuration change. No user-facing UI changes.

Verify that no errors appear in the JS console

PR Author Checklist

Screenshots/Videos

Android: Native

Android: mWeb Chrome

iOS: Native

iOS: mWeb Safari

MacOS: Chrome / Safari

The EXCEPTION clause telling the model to include uncertain violations was contributing to self-contradicting review comments (17-23% of output). Replace it with a self-critique gate that instructs the model to silently omit any violation it is not confident about, and to never include deliberation or hedging language in comment bodies.

kacper-mikolajczak · 2026-03-18T15:05:38Z

Closing - this approach re-invents prior attempts and doesn't target the root cause. Will investigate a different approach.

melvin-bot bot assigned kacper-mikolajczak Mar 12, 2026

Condense self-critique gate instruction

7e140fa

kacper-mikolajczak closed this Mar 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NoQA] Replace err-on-side-of-flagging with self-critique gate in code reviewer#85068

[NoQA] Replace err-on-side-of-flagging with self-critique gate in code reviewer#85068
kacper-mikolajczak wants to merge 2 commits intoExpensify:mainfrom
kacper-mikolajczak:suppress-self-contradicting-reviews

kacper-mikolajczak commented Mar 12, 2026

Uh oh!

kacper-mikolajczak commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kacper-mikolajczak commented Mar 12, 2026

Explanation of Change

Fixed Issues

Tests

Offline tests

QA Steps

PR Author Checklist

Screenshots/Videos

Uh oh!

kacper-mikolajczak commented Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant