Is your feature request related to a problem? Please describe.
llamafile seems to have quite of a speedup in terms of execution with CPU
Describe the solution you'd like
a llamafile backend
Describe alternatives you've considered
N/A
Additional context