Skip to content
#

brier-score

Here are 3 public repositories matching this topic...

Decision-safe evaluation + Streamlit dashboard for AI vs Human vs Post-Edited AI text detection. Generates a reliability report card (Accuracy, Macro F1, ECE, Brier), calibration plots, confidence histograms, and a coverage-vs-performance abstention curve. Recommends an operating threshold for human-review routing.

  • Updated Feb 14, 2026
  • Python

Production-style A/B testing with binomial GLMs (logit/probit): covariate adjustment, marginal ATE/risks, cluster-robust SEs, and Brier-score calibration.

  • Updated Feb 2, 2026
  • Python

Improve this page

Add a description, image, and links to the brier-score topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the brier-score topic, visit your repo's landing page and select "manage topics."

Learn more