Support generating with fallback for short form audio in Whisper

### Feature request

Generating with temperature fallback based on certain criteria was added to Whisper as part of the long-form generation. We should be able to apply the same fallback criteria to short-form audio. See the discussion [here](https://github.com/huggingface/transformers/pull/27658#issuecomment-1976370427).

### Motivation

The upstream OpenAI implementation does fallback for all audio. In fact there is no distinguishing between "short" and "long" audio, everything is essentially treated as "long audio", and if there's only one segment to transcribe, that's all.

See https://github.com/openai/whisper/blob/main/whisper/transcribe.py#L178

### Your contribution

I probably cannot address this myself.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support generating with fallback for short form audio in Whisper #29508

Feature request

Motivation

Your contribution

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Support generating with fallback for short form audio in Whisper #29508

Description

Feature request

Motivation

Your contribution

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions