[NoQA] Replace err-on-side-of-flagging with self-critique gate in code reviewer#85068
Closed
kacper-mikolajczak wants to merge 2 commits intoExpensify:mainfrom
Closed
Conversation
The EXCEPTION clause telling the model to include uncertain violations was contributing to self-contradicting review comments (17-23% of output). Replace it with a self-critique gate that instructs the model to silently omit any violation it is not confident about, and to never include deliberation or hedging language in comment bodies.
Contributor
Author
|
Closing - this approach re-invents prior attempts and doesn't target the root cause. Will investigate a different approach. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Explanation of Change
The AI code reviewer currently has an EXCEPTION clause telling it to "err on the side of including" uncertain violations. This causes ~17-23% of review comments to contain visible self-contradiction - the model flags a violation then talks itself out of it in the same comment ("actually this is correct", "upon re-reading, this is fine").
A previous attempt (PR #83184) added a "reality check" instruction that told the model to re-read and confirm violations before posting. This was reported to have the opposite effect - making the model deliberate more visibly in its output.
This PR replaces the EXCEPTION clause with a "self-critique gate" instruction that:
The key behavioral shift: "omit if not confident" instead of "check if correct."
Companion PRs: Auth and Web-Expensify (same change adapted for inline comment posting).
Fixed Issues
$ https://github.com/Expensify/Expensify/issues/605351
PROPOSAL:
Tests
Offline tests
N/A - AI reviewer agent prompt, no offline behavior.
QA Steps
// TODO: These must be filled out, or the issue title must include "[No QA]."
N/A - AI reviewer agent configuration change. No user-facing UI changes.
PR Author Checklist
### Fixed Issuessection aboveTestssectionOffline stepssectionQA stepssectiontoggleReportand notonIconClick)src/languages/*files and using the translation methodSTYLE.md) were followedAvatar, I verified the components usingAvatarare working as expected)StyleUtils.getBackgroundAndBorderStyle(theme.componentBG))npm run compress-svg)Avataris modified, I verified thatAvataris working as expected in all cases)Designlabel and/or tagged@Expensify/designso the design team can review the changes.ScrollViewcomponent to make it scrollable when more elements are added to the page.mainbranch was merged into this PR after a review, I tested again and verified the outcome was still expected according to theTeststeps.Screenshots/Videos
Android: Native
Android: mWeb Chrome
iOS: Native
iOS: mWeb Safari
MacOS: Chrome / Safari