Export of Openai Whisper with batched prompts by shubhambhokare1 · Pull Request #19854 · microsoft/onnxruntime

shubhambhokare1 · 2024-03-11T18:48:54Z

Adds an example to demonstrate the export of openai whipser implemenation with batch_size > 1 and addition of prompts for each audio snippet.

Also handles the scenario for when prompts are not of the same size. For example if our prompt ids are [p1_id_1, p1_id_2] and [p2_id_1], the final decoder_input_ids will look as such after padding:
[prev_token, p1_id_1, p1_id_2, start_token, lang_token, transcribe_token] [prev_token, p2_id_1, PAD_TOKEN, start_token, lang_token, transcribe_token]

onnxruntime/python/tools/transformers/models/whisper/whisper_helper.py

onnxruntime/python/tools/transformers/models/whisper/whisper_encoder_decoder_init.py

onnxruntime/python/tools/transformers/models/whisper/whisper_helper.py

onnxruntime/python/tools/transformers/models/whisper/whisper_openai_helper.py

Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com>

Adds an example to demonstrate the export of openai whipser implemenation with batch_size > 1 and addition of prompts for each audio snippet. Also handles the scenario for when prompts are not of the same size. For example if our prompt ids are [p1_id_1, p1_id_2] and [p2_id_1], the final decoder_input_ids will look as such after padding: `[prev_token, p1_id_1, p1_id_2, start_token, lang_token, transcribe_token] [prev_token, p2_id_1, PAD_TOKEN, start_token, lang_token, transcribe_token]` --------- Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com>

Add logic for padding prompts

439211c

shubhambhokare1 requested a review from kunal-vaishnavi March 11, 2024 18:48

Fix lint

9ffae5b

shubhambhokare1 mentioned this pull request Mar 11, 2024

Support Export of Openai Whisper [Batched decoding ver] #18815

Closed

github-advanced-security bot found potential problems Mar 11, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/whisper/whisper_helper.py Fixed Show fixed Hide fixed

Modularize verify_onnx

b34a04a

kunal-vaishnavi reviewed Mar 15, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/whisper/whisper_helper.py Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed Mar 15, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/whisper/whisper_helper.py Show resolved Hide resolved

kunal-vaishnavi reviewed Mar 15, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/whisper/whisper_helper.py Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed Mar 15, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/whisper/whisper_helper.py Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed Mar 15, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/whisper/whisper_helper.py Outdated Show resolved Hide resolved

shubhambhokare1 added 4 commits March 28, 2024 07:05

minor edits

44efe35

remove looping logic

d225e4d

Avoid model cloning

94a0ddb

lint

e7a48be

kunal-vaishnavi reviewed Mar 30, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/whisper/whisper_encoder_decoder_init.py Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed Mar 30, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/whisper/whisper_helper.py Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed Mar 30, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/whisper/whisper_helper.py Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed Mar 30, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/whisper/whisper_helper.py Outdated Show resolved Hide resolved

kunal-vaishnavi reviewed Mar 30, 2024

View reviewed changes

onnxruntime/python/tools/transformers/models/whisper/whisper_openai_helper.py Outdated Show resolved Hide resolved

Apply suggestions from code review

068f673

Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com>

thiagocrepaldi requested a review from kunal-vaishnavi April 1, 2024 16:39

sophies927 added the release:1.17.3 label Apr 2, 2024

Add cache dir

d33fb3a

kunal-vaishnavi approved these changes Apr 2, 2024

View reviewed changes

YUNQIUGUO merged commit be831e1 into microsoft:main Apr 3, 2024

dependabot bot mentioned this pull request Jan 17, 2026

nuget: Bump the dotnet-minor group with 10 updates psford/claudeProjects#4

Merged

dependabot bot mentioned this pull request Jan 27, 2026

Bump Microsoft.ML.OnnxRuntime from 1.17.0 to 1.23.2 freduardo4/H.O.P.E.#16

Closed

dependabot bot mentioned this pull request Feb 10, 2026

Bump Microsoft.ML.OnnxRuntime from 1.17.0 to 1.24.1 freduardo4/H.O.P.E.#51

Open

dependabot bot mentioned this pull request Feb 23, 2026

Bump Microsoft.ML.OnnxRuntime from 1.17.0 to 1.24.2 PrivStackApp/PrivStack-IO#57

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Export of Openai Whisper with batched prompts#19854

Export of Openai Whisper with batched prompts#19854
YUNQIUGUO merged 9 commits intomicrosoft:mainfrom
shubhambhokare1:sbhokare/batched-whisper

shubhambhokare1 commented Mar 11, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

shubhambhokare1 commented Mar 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

shubhambhokare1 commented Mar 11, 2024 •

edited

Loading