Create "docker model bench MODEL" #480

Closed

ericcurtin/model-runner

Assignees

Labels

good first issue

opened

It should be able to output the Tokens per Second of any give model with 1, 2, 4 and 8 concurrent requests. Provide a hyperfine-like experience.

Metadata

Assignees

Copilot

Labels

good first issue

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests