-
Notifications
You must be signed in to change notification settings - Fork 156
Nothing to see here #1189
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weβll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Nothing to see here #1189
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -333,7 +333,7 @@ def test_multi_node_eval_conc_uses_only_conc_values_at_or_above_min_conc(self): | |
| result = mark_eval_entries(matrix_values) | ||
|
|
||
| assert result[0]["run-eval"] is True | ||
| assert result[0]["eval-conc"] == 32 | ||
| assert result[0]["eval-conc"] == 16 | ||
|
Contributor
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. π‘ Nit: After lowering Extended reasoning...What the bug isThe sibling test return sorted(c for c in conc_values if c >= MIN_EVAL_CONC)Before this PR, Why it no longer exercises the threshold pathAfter this PR lowers
Counterfactual proofIf you completely removed the Impact
How to fixLower one entry's Severity rationaleNit. The test passes, no production behavior is wrong, and the multi-node sibling test still covers the threshold guard. The only regression is in test-coverage documentation/guarantee for the single-node path. |
||
| assert result[1]["run-eval"] is False | ||
|
|
||
| def test_marks_highest_and_median_conc(self): | ||
|
|
||
Uh oh!
There was an error while loading. Please reload this page.