Whisper fix audio out of range#31770
Conversation
|
Gentle ping @sanchit-gandhi @kamilakesbi |
|
cc @ylacombe |
|
Hey @maxkvbn, thanks for opening this PR and for #31683! Do you think you could send the audio that causes your code to fail? The link to download it is no longer working. Or better yet, an end-to-end snippet to reproduce it. If that's okay with you, I'll review your PR once I can reproduce the error! |
|
Hey @maxkvbn, gentle up! |
|
Hey @ylacombe I've reuploaded the audio in question: https://fastupload.io/b9ff61358ef5c4b1 |
|
Thanks for the audio @maxkvbn, I was able to reproduce the issue! I believe the issue happens when using |
|
Hey @maxkvbn, after some deeper investigation, I'm fairly certain the bugs come from the generation itself, and not the tokenization! I still have to figure out how to actually fix it, but it comes from a recent change in |
What does this PR do?
Fixes #31683
Before submitting
Pull Request section?
to it if that's the case.
Whisper - list index out of range with word level timestamps #31683
documentation guidelines, and
here are tips on formatting docstrings.
Who can review?
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.