FEAT: Threshold and ratio configuration and testing file for optimal threshold and ratio configuration by rafainn · Pull Request #21 · Roblox/Sentinel

rafainn · 2025-08-18T13:29:08Z

Built upon pull request #7

This pull request introduces significant improvements to the Sentinel library, focusing on aggregation flexibility, explainability, and performance optimizations. The README is updated to document new aggregation strategies and explainability features, and the codebase now exposes multiple aggregation functions for scoring, adds per-text explanations, and improves model caching and negative sample ratio handling.

Aggregation and Explainability Enhancements:

Added multiple aggregation strategies (skewness, top_k_mean, percentile_score, softmax_weighted_mean, max_score) for combining observation scores, with documentation and usage examples in README.md. [1] [2] [3] [4]
Introduced per-text explainability in results, including top-K positive/negative similarities, contrastive components, and neighbor snippets, as shown in the updated RareClassAffinityResult dataclass and README usage examples. [1] [2]

Performance and Robustness Improvements:

Implemented global caching for SentenceTransformer models in src/sentinel/embeddings/sbert.py to avoid redundant loading, with cache management utilities. Global caching seems to have reduced load time of ~300 conversations down to 3.5s from the previous 12.3s.

This includes an input Cache_Model in the calculate_rare_class_affinity model in src\sentinel\sentinel_local_index.py to enable and disable caching easily, depending on space constraints and model requirement

Improved handling of negative-to-positive ratio when loading indices, including error handling and preserving original ratios when needed. [1] [2] [3]
Created a detailed testing file for changes and how it effects performance test_thresholds_and_ratios in examples/Example_Threshold_Script.py and how different ratios and temperatures affect detection, this shows a high relation with using 0.00 and 0.01 temperature, and ratios of 2-4:1 for optimal accuracy and minimal false positives.

API and Documentation Updates:

Updated __init__.py to expose new aggregation functions in the public API.
Enhanced documentation and comments for scoring functions and result types, clarifying their purpose and usage. [1] [2]

These changes collectively make Sentinel more configurable, interpretable, and efficient for diverse deployment scenarios.

…SentinelLocalIndex. FEAT: created a testing tool for best threshold and ratio analysis

…ADME

…e with optional flags. DOCS: Updated relevent documentation with these fixes

rafainn · 2025-08-18T13:31:41Z

Note: All 20 tests passed with two warnings regarding configuration of the pytests as follows, unsure if this is due to an outdated version of the pytest library, or if these config keys have depreciated.
".venv\Lib\site-packages_pytest\config_init_.py:1441
D:\PROGRAMMING HHD\Sentinel\Sentinel.venv\Lib\site-packages_pytest\config_init_.py:1441: PytestConfigWarning: Unknown config option: showlocals

self._warn_or_fail_if_strict(f"Unknown config option: {key}\n")

.venv\Lib\site-packages_pytest\config_init_.py:1441
D:\PROGRAMMING HHD\Sentinel\Sentinel.venv\Lib\site-packages_pytest\config_init_.py:1441: PytestConfigWarning: Unknown config option: verbose

self._warn_or_fail_if_strict(f"Unknown config option: {key}\n")

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html"

…only mode

…dling and fallback for ratio adjustments

Feat: Adjusts message metrics dynamically

…prevent inclusion

… load time of model exponentially after the first caching. TESTS: Updated embedding tests to include caching and its management functions FEAT: Updated the example script for testing purposes to include caching mechanics

- Apply PEP 8 formatting to Example_Threshold_Script.py - Update embeddings.safetensors - Update sentinel_against_hate.ipynb - Fixed line length violations (max 79 characters) - Corrected indentation and spacing - Enhanced readability while maintaining functionality

…ormance

…r speech examples

…mpty score arrays, fixes edge case, NaN returns.

…_affinity`, update example file to use path/to/index rather than local path

…omponents in score_formulae and SentinelLocalIndex

…onality, removed redundant exports, added no-cache flag to the testing script

rafainn · 2026-02-19T19:43:20Z

@vcai4071 all requested changes requested have been made

vcai4071

LGTM, thanks for the great additions!

leoRblx · 2026-05-06T20:16:46Z

@rafainn Can you take a look at the failing test I will merge once the test are passing.
Thx

… have support for PEP 517 builds hence swapped to ^2.0.0 which is compatable - may require further testing however didn't impact functionality of code

rafainn · 2026-05-07T22:21:29Z

@leoRblx Would you be able to run the tests again, this should fix the build issue it was displaying earlier, however I am unsure if there would be any further conflicts.

ch1kim0n1 and others added 4 commits August 15, 2025 12:44

improved systems

c3dc281

FIX: Refactor negative_to_positive_ratio parameter to be optional in …

4ba5790

…SentinelLocalIndex. FEAT: created a testing tool for best threshold and ratio analysis

DOCS: Add section on testing optimal thresholds and data ratios in RE…

74397ba

…ADME

FEAT: Added a review mode for detailed information regarding each cas…

87ad66a

…e with optional flags. DOCS: Updated relevent documentation with these fixes

rafainn added 14 commits August 18, 2025 20:14

FEAT: Enhance threshold testing with performance metrics and results-…

df18d4f

…only mode

FEAT: Set default negative_to_positive_ratio to 5.0 and add error han…

edbdc4d

…dling and fallback for ratio adjustments

FIX: Update user profile creation to exclude sexual content examples

f3c08a8

Feat: Adjusts message metrics dynamically

FIX: Comment out sexual content examples in user profile creation to …

41f32e8

…prevent inclusion

DOCS: Fixed formatting for better PEP8 format

f17a8ca

FEAT: Add caching option to SentinelLocalIndex for improved load perf…

7ee49f8

…ormance

REFACTOR: Refactor user profile creation to use external test data fo…

b638046

…r speech examples

TEST: Update mean_of_positives tests to return 0.0 for negative and e…

e3277f1

…mpty score arrays, fixes edge case, NaN returns.

CHORE: Remove redundant Cache_Model variable in `calculate_rare_class…

1b6a2ee

…_affinity`, update example file to use path/to/index rather than local path

TEST: Add edge case tests for aggregation functions and contrastive_c…

d63b8ce

…omponents in score_formulae and SentinelLocalIndex

Chore: removed redundant imports

ff7061c

chore: Changed caching to be false by default, to preserve old functi…

b551f1c

…onality, removed redundant exports, added no-cache flag to the testing script

vcai4071 reviewed Feb 19, 2026

View reviewed changes

Comment thread src/sentinel/sentinel_local_index.py Outdated

Comment thread src/sentinel/sentinel_local_index.py Outdated

Comment thread src/sentinel/sentinel_local_index.py Outdated

rafainn added 2 commits February 19, 2026 19:27

All requested changes made

0b5a487

fix capitalisation issue

32d644a

rafainn requested a review from vcai4071 February 19, 2026 19:45

vcai4071 approved these changes Feb 19, 2026

View reviewed changes

The should fix thee build issues for CI tests as pandas ^1.0.0 didn't…

e8535cd

… have support for PEP 517 builds hence swapped to ^2.0.0 which is compatable - may require further testing however didn't impact functionality of code

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: Threshold and ratio configuration and testing file for optimal threshold and ratio configuration#21

FEAT: Threshold and ratio configuration and testing file for optimal threshold and ratio configuration#21
rafainn wants to merge 21 commits into
Roblox:mainfrom
rafainn:Threshold-and-ratio-configuration

rafainn commented Aug 18, 2025 •

edited

Loading

Uh oh!

rafainn commented Aug 18, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rafainn commented Feb 19, 2026 •

edited

Loading

Uh oh!

vcai4071 left a comment

Uh oh!

leoRblx commented May 6, 2026

Uh oh!

rafainn commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

rafainn commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Built upon pull request #7

Uh oh!

rafainn commented Aug 18, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rafainn commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vcai4071 left a comment

Choose a reason for hiding this comment

Uh oh!

leoRblx commented May 6, 2026

Uh oh!

rafainn commented May 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rafainn commented Aug 18, 2025 •

edited

Loading

rafainn commented Feb 19, 2026 •

edited

Loading