I have a lot of prompts to feed to model, how can i accelerate the process of inference?
I have a lot of prompts to feed to model, how can i accelerate the process of inference?