Skip to content

Add dashboard test type filters and charts#22

Merged
AustinKelsay merged 1 commit intomainfrom
staging
Mar 30, 2026
Merged

Add dashboard test type filters and charts#22
AustinKelsay merged 1 commit intomainfrom
staging

Conversation

@AustinKelsay
Copy link
Copy Markdown
Owner

@AustinKelsay AustinKelsay commented Mar 30, 2026

Summary by CodeRabbit

Release Notes

  • New Features

    • Added "Test Type" filter to the leaderboard for granular data filtering.
    • Added "By Test Type" breakdown tabs to performance charts for enhanced visualization.
  • Improvements

    • Updated tooltip descriptions to include test type information in chart documentation.
    • Improved filter layout to accommodate the new filtering option.

@vercel
Copy link
Copy Markdown

vercel Bot commented Mar 30, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
plebdev-bench-dashboard Ready Ready Preview, Comment Mar 30, 2026 0:48am

Request Review

@AustinKelsay AustinKelsay merged commit 9994878 into main Mar 30, 2026
2 of 3 checks passed
@coderabbitai
Copy link
Copy Markdown

coderabbitai Bot commented Mar 30, 2026

Caution

Review failed

The pull request is closed.

ℹ️ Recent review info
⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 77bb3728-6018-4423-922b-6ef6ee49bfc8

📥 Commits

Reviewing files that changed from the base of the PR and between 0807904 and b3e1910.

📒 Files selected for processing (9)
  • apps/dashboard/src/components/charts/composite-score-chart.tsx
  • apps/dashboard/src/components/charts/pass-rate-chart.tsx
  • apps/dashboard/src/components/leaderboard/leaderboard-filters.ts
  • apps/dashboard/src/components/leaderboard/leaderboard-page.tsx
  • apps/dashboard/src/components/run-detail/dimension-detail-dialog.tsx
  • apps/dashboard/src/lib/aggregations-core.ts
  • apps/dashboard/src/lib/tooltip-content.ts
  • apps/dashboard/src/lib/utils.ts
  • test/dashboard-test-type.test.ts

📝 Walkthrough

Walkthrough

This PR introduces a new "test type" grouping dimension across the dashboard. It adds an aggregation utility, updates multiple chart and filter components to support test type grouping, implements formatting for category labels, and includes comprehensive test coverage for the new functionality.

Changes

Cohort / File(s) Summary
Core Aggregation
apps/dashboard/src/lib/aggregations-core.ts
Added new groupByTestType() utility function that groups MatrixItemResult items by their category field, with fallback to "uncategorized".
Chart Components
apps/dashboard/src/components/charts/composite-score-chart.tsx, apps/dashboard/src/components/charts/pass-rate-chart.tsx
Added test type tabs and grouping to both composite and pass-rate charts; extended onDimensionClick callback to include "testType" dimension; updated component signatures to handle test type grouping and formatting.
Dimension Dialog
apps/dashboard/src/components/run-detail/dimension-detail-dialog.tsx
Extended DimensionType to include testType; added filtering and formatting for test type dimension; added test type sub-dimension breakdown with optional name formatting callback.
Leaderboard Filtering
apps/dashboard/src/components/leaderboard/leaderboard-filters.ts, apps/dashboard/src/components/leaderboard/leaderboard-page.tsx
Extended FilterState interface and initialization to include testType field; implemented test type filter logic deriving type from item.category; added test type filter dropdown to leaderboard UI with adjusted grid layout.
Utilities & Formatting
apps/dashboard/src/lib/utils.ts
Added new formatTestCategoryLabel() function that converts category slugs (e.g., "computer-use""Computer Use") to user-readable labels.
Tooltip Updates
apps/dashboard/src/lib/tooltip-content.ts
Updated composite chart tooltip text to mention "test types" alongside existing dimensions.
Test Coverage
test/dashboard-test-type.test.ts
Added comprehensive test suite with fixture builders validating test type grouping computations and leaderboard filtering behavior.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

Poem

🐰 A new dimension hops into view,
Test types grouped where none once grew,
Categories formatted, filters refined,
Dashboard dimensions perfectly aligned! ✨

✨ Finishing Touches
📝 Generate docstrings
  • Create stacked PR
  • Commit on current branch
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch staging

Comment @coderabbitai help to get the list of available commands and usage tips.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant