Fix loss computation in TFWav2Vec2ForCTC by Sreyan88 · Pull Request #18014 · huggingface/transformers

Sreyan88 · 2022-07-04T13:43:15Z

What does this PR do?

TFWav2Vec2ForCTC implementation was incorrect. The CTC loss calculation wasn't proper. The root of the problem was that the CTC target labels weren't reaching the loss calculation and it was None. So adding @unpack_inputs now unpacks the input properly and loss calculation is properly done.

Additionally, the loss needed to be reshaped for backpropagation.

Fixes #18009

HuggingFaceDocBuilderDev · 2022-07-04T13:53:29Z

The documentation is not available anymore as the PR was closed or merged.

Rocketknight1 · 2022-07-04T16:11:19Z

This looks good to me! We just made the same change to several other losses (reshaping the output from a scalar to a tensor with shape (1,)).

My only concern is that I have no idea how the lack of @unpack_inputs was missed by tests, but I'm happy to merge this for now and think about how to expand test coverage afterwards!

Rocketknight1 · 2022-07-04T16:12:25Z

@Sreyan88 I'm happy with this and I think it's ready to merge now - if you want to make any other changes, now's the time. If not, ping me and I'll merge it!

Sreyan88 · 2022-07-04T16:34:30Z

@Sreyan88 I'm happy with this and I think it's ready to merge now - if you want to make any other changes, now's the time. If not, ping me and I'll merge it!

@Rocketknight1 I'm happy you can merge!

Co-authored-by: Sreyan-G@NVIDIA <sreyang@nvidia.com>

fixed calculation of ctc loss in TFWav2Vec2ForCTC

539ff7e

Rocketknight1 self-requested a review July 4, 2022 16:06

Rocketknight1 self-assigned this Jul 4, 2022

Rocketknight1 approved these changes Jul 4, 2022

View reviewed changes

Rocketknight1 merged commit e3139ad into huggingface:main Jul 4, 2022

viclzhu pushed a commit to viclzhu/transformers that referenced this pull request Jul 18, 2022

fixed calculation of ctc loss in TFWav2Vec2ForCTC (huggingface#18014)

02215e7

Co-authored-by: Sreyan-G@NVIDIA <sreyang@nvidia.com>

amyeroberts mentioned this pull request Feb 9, 2023

Replace input_values_processing with unpack_inputs #21502

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix loss computation in TFWav2Vec2ForCTC#18014

Fix loss computation in TFWav2Vec2ForCTC#18014
Rocketknight1 merged 1 commit intohuggingface:mainfrom
Sreyan88:fix-ctc-wav2vec2-tf

Sreyan88 commented Jul 4, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Jul 4, 2022 •

edited

Loading

Uh oh!

Rocketknight1 commented Jul 4, 2022

Uh oh!

Rocketknight1 commented Jul 4, 2022

Uh oh!

Sreyan88 commented Jul 4, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Sreyan88 commented Jul 4, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Jul 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Rocketknight1 commented Jul 4, 2022

Uh oh!

Rocketknight1 commented Jul 4, 2022

Uh oh!

Sreyan88 commented Jul 4, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

HuggingFaceDocBuilderDev commented Jul 4, 2022 •

edited

Loading