Fixed resize_token_embedding issue #21053 by susnato · Pull Request #21065 · huggingface/transformers

susnato · 2023-01-09T15:35:38Z

What does this PR do?

There was a typo in Line 449 in huggingface/transformers/blob/main/src/transformers/models/gpt2/modeling_tf_gpt2.py where the code was doing a check between input_ids and self.vocab_size but resize_token_embeddings change self.config.vocab_size so we were getting the error described in the issue, to overcome this I replaced it with self.config.vocab_size and it worked.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed.
@sgugger

HuggingFaceDocBuilderDev · 2023-01-09T15:50:43Z

The documentation is not available anymore as the PR was closed or merged.

sgugger · 2023-01-09T17:23:41Z

cc @gante and @Rocketknight1

gante

This is absolutely correct. self.vocab_size can easily get stale when the vocabulary gets updated, and the check should be done against the config.

(there are other models with this issue, where the fix needs to be slightly different, so I'll have a look very soon)

gante · 2023-01-16T12:09:10Z

@sgugger feel free to merge if you approve. As I wrote above, other models have a similar problem (which require a more elaborate fix)

gante · 2023-01-16T12:10:13Z

@susnato can you remove the Fixes https://github.com/huggingface/transformers/issues/21053 at the top? That way, the issue stays open and I'll likely won't forget to fix the other models :)

susnato · 2023-01-16T12:14:35Z

@susnato can you remove the Fixes https://github.com/huggingface/transformers/issues/21053 at the top? That way, the issue stays open and I'll likely won't forget to fix the other models :)

Hi, @gante I removed the line...is it ok now?

sgugger

Thanks for the fix!

susnato · 2023-01-16T18:46:51Z

This is absolutely correct. self.vocab_size can easily get stale when the vocabulary gets updated, and the check should be done against the config.

(there are other models with this issue, where the fix needs to be slightly different, so I'll have a look very soon)

Hi, @gante if you want, I would be happy to look into this and fix if I can.

gante · 2023-01-17T10:54:44Z

@susnato sounds good!

My plan consists in removing all references to self.vocab_size, deleting the variable whenever it is a variable that is set at __init__ time from the config (if needed, store the config in self.config instead, since it will hold the mutable vocabulary size).

If you search for "tf.cast(self.vocab_size", you will find all matches that will likely have to be touched.

susnato · 2023-01-17T18:47:08Z

@susnato sounds good!

My plan consists in removing all references to self.vocab_size, deleting the variable whenever it is a variable that is set at __init__ time from the config (if needed, store the config in self.config instead, since it will hold the mutable vocabulary size).

If you search for "tf.cast(self.vocab_size", you will find all matches that will likely have to be touched.

Hi @gante I am going to check for all models in src/transformers/models/modeling_tf_<model>.py to remove references of self.vocab_size and also I found some references of self.vocab_size in some of the <model>MLMHead, I need to change them too right?

gante · 2023-01-18T10:24:16Z

@susnato yes. If we look at the corresponding PT implementation e.g. for Albert, the layer classes store self.config = config for future use, as opposed to individual attributes of config. Making the switch here protects us from errors like the one that originated this PR :)

Fixed issue huggingface#21053

e064c21

gante approved these changes Jan 16, 2023

View reviewed changes

sgugger approved these changes Jan 16, 2023

View reviewed changes

sgugger merged commit a5327c6 into huggingface:main Jan 16, 2023

susnato deleted the tfgpt2_resize_token_embedding branch January 17, 2023 15:19

susnato mentioned this pull request Jan 18, 2023

deleted references of self.vocab_size and self.type_vocab_size for multiple models [TF implementation] #21164

Merged

5 tasks

susnato changed the title ~~Fixed issue #21053~~ Fixed resize_token_embedding issue #21053 Jan 27, 2024

ArthurZucker mentioned this pull request Sep 6, 2024

🔴 🚨 Resizing tokens embeddings: initialize from old embeddings' normal distribution. #33325

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed resize_token_embedding issue #21053#21065

Fixed resize_token_embedding issue #21053#21065
sgugger merged 1 commit intohuggingface:mainfrom
susnato:tfgpt2_resize_token_embedding

susnato commented Jan 9, 2023 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jan 9, 2023 •

edited

Loading

Uh oh!

sgugger commented Jan 9, 2023

Uh oh!

gante left a comment

Uh oh!

gante commented Jan 16, 2023

Uh oh!

gante commented Jan 16, 2023

Uh oh!

susnato commented Jan 16, 2023

Uh oh!

sgugger left a comment

Uh oh!

susnato commented Jan 16, 2023

Uh oh!

gante commented Jan 17, 2023

Uh oh!

susnato commented Jan 17, 2023 •

edited

Loading

Uh oh!

gante commented Jan 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

susnato commented Jan 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Jan 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sgugger commented Jan 9, 2023

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

gante commented Jan 16, 2023

Uh oh!

gante commented Jan 16, 2023

Uh oh!

susnato commented Jan 16, 2023

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

susnato commented Jan 16, 2023

Uh oh!

gante commented Jan 17, 2023

Uh oh!

susnato commented Jan 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gante commented Jan 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

susnato commented Jan 9, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 9, 2023 •

edited

Loading

susnato commented Jan 17, 2023 •

edited

Loading