Feature request
Generating with temperature fallback based on certain criteria was added to Whisper as part of the long-form generation. We should be able to apply the same fallback criteria to short-form audio. See the discussion here.
Motivation
The upstream OpenAI implementation does fallback for all audio. In fact there is no distinguishing between "short" and "long" audio, everything is essentially treated as "long audio", and if there's only one segment to transcribe, that's all.
See https://github.com/openai/whisper/blob/main/whisper/transcribe.py#L178
Your contribution
I probably cannot address this myself.
Feature request
Generating with temperature fallback based on certain criteria was added to Whisper as part of the long-form generation. We should be able to apply the same fallback criteria to short-form audio. See the discussion here.
Motivation
The upstream OpenAI implementation does fallback for all audio. In fact there is no distinguishing between "short" and "long" audio, everything is essentially treated as "long audio", and if there's only one segment to transcribe, that's all.
See https://github.com/openai/whisper/blob/main/whisper/transcribe.py#L178
Your contribution
I probably cannot address this myself.