CLI benchmark suite for LLM providers and OpenAI-compatible gateways. Measure TTFT, latency, p95, throughput, warmup, and history.
-
Updated
Mar 19, 2026 - Python
CLI benchmark suite for LLM providers and OpenAI-compatible gateways. Measure TTFT, latency, p95, throughput, warmup, and history.
The only voice agent context manager with a TTFT feedback loop
LLM inference benchmarking toolkit. Measure TTFT, inter-token latency, throughput, and P50–P99 across concurrency levels.
Add a description, image, and links to the ttft topic page so that developers can more easily learn about it.
To associate your repository with the ttft topic, visit your repo's landing page and select "manage topics."