Skip to content

Build zero-shot evaluation pipeline and run baseline execution #43

@eemilkos

Description

@eemilkos

This parent issue covers the implementation work needed to run baseline zero-shot evaluations on selected LLMs.

Sub-issues under this parent should define the shared API interface, response logging, scoring, and execution of the zero-shot runs.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    Status

    Master Backlog

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions