### π Enhancement description Details to include: - β Tokens used for each question (and average per model) - β Separate model input & output price - β Duration for each question (and average per model) - β Cost for each question (and average per model) - β TPS of model, like price of model - β total cost (sum of all) - β Remember total tool calls done for each question, and average on model - Structure that stores old benchmarks too, not just latest ### π€ Pitch More insightful benhmark ### π Have you spent some time to check if this issue has been raised before? - [x] I checked and didn't find similar issue ### π’ Have you read the Code of Conduct? - [x] I have read the [Code of Conduct](https://github.com/appwrite/.github/blob/main/CODE_OF_CONDUCT.md)
π Enhancement description
Details to include:
π€ Pitch
More insightful benhmark
π Have you spent some time to check if this issue has been raised before?
π’ Have you read the Code of Conduct?