Is your feature request related to a problem?
No response
Describe the solution you'd like
Berkeley BFCL eval is focused on multiple tool calling scenarios, measuring the model's ability to properly handle and invoke tools.
this will be a good addition https://github.com/ShishirPatil/gorilla
What are you requesting?
New benchmark/evaluation
Describe alternatives you've considered
No response
Use case
Testing BFCL live and non-live, datasets
https://gorilla.cs.berkeley.edu/leaderboard.html
Additional context
No response