Skip to content

feat: Add SCTBenchScenario#12

Open
MiguelAFH wants to merge 1 commit intomainfrom
sct-bench
Open

feat: Add SCTBenchScenario#12
MiguelAFH wants to merge 1 commit intomainfrom
sct-bench

Conversation

@MiguelAFH
Copy link
Copy Markdown
Collaborator

Added support for SCT-Bench. As the original benchmark, we added support for:

  • reason (false by default): If true, asks the model to give a brief explanation to their response. Otherwise, not explanation is asked.
  • few_shot (false by default): If true, provides the model with the predefined few-shot examples from the original SCT-Bench. Otherwise, the task is zero-shot.

NOTE: Currently, SCT-Bench has open-sourced 174/750 questions. This PR uses the public examples as the test split.

@MiguelAFH MiguelAFH requested a review from blidiselalin April 30, 2026 20:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant