Fix incorrect comments about atten mask for pytorch backend by lygztq · Pull Request #18728 · huggingface/transformers

lygztq · 2022-08-23T08:57:15Z

What does this PR do?

I found these comments do not match the actual behavior. In these comments we use -10000.0 to mask out elements before softmax but the actual value used is torch.finfo(dtype).min.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

HuggingFaceDocBuilderDev · 2022-08-23T09:10:18Z

The documentation is not available anymore as the PR was closed or merged.

ydshieh · 2022-08-23T09:53:08Z

@lygztq Great catch. This inconsistency was introduced in (my) PR #17306, where the docstrings were not updated - my bad.

ydshieh · 2022-08-23T10:02:50Z

I think it's more clear to change from ... the smallest value ... to ... the dtype's smallest value .... WDYT?

lygztq · 2022-08-23T10:20:28Z

I think it's more clear to change from ... the smallest value ... to ... the dtype's smallest value .... WDYT?

I agree, this is better.

ydshieh · 2022-09-23T17:39:33Z

            # effectively the same as removing these entirely.
            attention_mask = attention_mask.to(dtype=self.dtype)  # fp16 compatibility
-            attention_mask = (1.0 - attention_mask) * -10000.0
+            attention_mask = (1.0 - attention_mask) * torch.finfo(self.dtype).min


CodeGen was added after #17306. So update it here along with the comment.

ydshieh

Hi @lygztq Sorry for being so late for this PR. I updated the PR with the main branch, and with an update for CodeGen model. Once the tests pass, we are ready to merge the PR. Thank you for the fix 🤗 !

sgugger

Thanks a lot for fixing!

huggingface deleted a comment from github-actions Bot Sep 22, 2022

lygztq and others added 3 commits September 23, 2022 19:31

fix incorrect comments about atten mask

cc8ae27

typo

745a2ee

Update for CodeGen

67e83b2

ydshieh force-pushed the fix-comment branch from 86f8c5a to 67e83b2 Compare September 23, 2022 17:37

ydshieh reviewed Sep 23, 2022

View reviewed changes

ydshieh approved these changes Sep 23, 2022

View reviewed changes

ydshieh requested a review from sgugger September 23, 2022 17:42

sgugger approved these changes Sep 23, 2022

View reviewed changes

sgugger merged commit ece7624 into huggingface:main Sep 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix incorrect comments about atten mask for pytorch backend#18728

Fix incorrect comments about atten mask for pytorch backend#18728
sgugger merged 3 commits intohuggingface:mainfrom
lygztq:fix-comment

lygztq commented Aug 23, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Aug 23, 2022 •

edited

Loading

Uh oh!

ydshieh commented Aug 23, 2022

Uh oh!

ydshieh commented Aug 23, 2022

Uh oh!

lygztq commented Aug 23, 2022

Uh oh!

ydshieh Sep 23, 2022 •

edited

Loading

Uh oh!

ydshieh left a comment

Uh oh!

sgugger left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

lygztq commented Aug 23, 2022

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Aug 23, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ydshieh commented Aug 23, 2022

Uh oh!

ydshieh commented Aug 23, 2022

Uh oh!

lygztq commented Aug 23, 2022

Uh oh!

ydshieh Sep 23, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ydshieh left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

HuggingFaceDocBuilderDev commented Aug 23, 2022 •

edited

Loading

ydshieh Sep 23, 2022 •

edited

Loading