Prerequisites
Please answer the following questions for yourself before submitting an issue.
Expected Behavior
Prompt eval time is around twice as long as eval time (12 tokens/sec vs 22 tokens/sec). Is there a way to make them both the same speed?
Current Behavior
Prompt eval time takes twice as long as eval time.
Prerequisites
Please answer the following questions for yourself before submitting an issue.
Expected Behavior
Prompt eval time is around twice as long as eval time (12 tokens/sec vs 22 tokens/sec). Is there a way to make them both the same speed?
Current Behavior
Prompt eval time takes twice as long as eval time.