deleted references of self.vocab_size and self.type_vocab_size for multiple models [TF implementation] by susnato · Pull Request #21164 · huggingface/transformers

susnato · 2023-01-18T06:26:50Z

What does this PR do?

This PR deletes references of self.vocab_size and self.type_vocab_size for these models [Tensorflow implementation] : bert, albert, lxmert, electra, tapas, convbert, layoutlm, gpt2, camembert, clip, ctrl, deberta, deberta_v2, distilbert, esm, funnel, gptj, groupvit, longformer, mobilebert, mpnet, openai, rembert, roberta, roberta_prelayernorm, roformer, xlm_roberta, xlnet.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.(link)
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@gante

susnato · 2023-01-18T06:34:21Z

Hi, @gante as you said here I made changes for src/transformers/models/albert/modeling_tf_albert.py, could you please check it? If it's ok then I will push other changes of other models.

The failed test ci/circleci: check_repository_consistency are due to the fact that I only changed Embeddings for albert model and it is different from TFBertEmbeddings , the test will be successful when I change them too.

HuggingFaceDocBuilderDev · 2023-01-18T06:43:35Z

The documentation is not available anymore as the PR was closed or merged.

gante

@susnato LGTM, this is precisely what I wanted to do!

gante · 2023-01-18T10:44:04Z

if you have the patience to do it, removing the other attributes that are set from config (like self.type_vocab_size) would also be great :) That way there will be a greater similarity between PT and TF 🧡

Yes, that would be great!

@gante
Ok, I will remove self.type_vocab_size, btw there seem to be 3 more lines

self.embedding_size = config.embedding_size self.max_position_embeddings = config.max_position_embeddings self.initializer_range = config.initializer_range

I don't think we need to change them for this purpose, right?

It's not needed, but it would be a nice refactor :) Don't worry about it if you are not interested in making the change, we can pick it up afterward -- the vocab_size is the most important one for now, as it is causing bugs.

gante · 2023-01-18T10:55:21Z

@susnato before moving forward, let's make sure our transformers master agrees with the set of changes we are about to make :)

@sgugger this PR contains a change I want to apply across TF models, and @susnato is kindly doing the bulk of the work. In essence, some config attributes are not constant throughout the model's life, like config.vocab_size, and our TF implementation stores them as immutable class attributes (e.g. self.vocab_size = config.vocab_size). PT doesn't have this problem, since it simply stores self.config = config in the layers, which benefits from the updates the mutable config dictionary may receive elsewhere.

The proposed change is to make TF implementation closer to PT implementation and store self.config in the layers, as opposed to individual configuration parameters. This also solves the bug that triggered this discussion, where the vocabulary size was not being correctly updated and causing exceptions.

Let us know if you are okay with us making this change over most model architectures 🚀

sgugger

I have always wondered why TensorFlow models did not store the config. Just pinging @Rocketknight1 to also have a look in case there is some TF-arcane behing this that would break something but otherwise all for this change!

sgugger · 2023-01-18T14:42:19Z

Yes, that would be great!

…bert

…utlm

…mbert

…rta_v2

…ilbert

…pvit

…former

…lebert

gante · 2023-01-20T10:57:41Z

@susnato you might need to run make fixup locally, to automatically format the code and make our CI happy

…rta_prelayernorm

…rmer

…roberta

susnato · 2023-01-20T12:36:47Z

Hi, @gante I added all the models I found to have self.vocab_size and removed reference to self.vocab_size and self.type_vocab_size and also all the tests are passed! Could you please check it?

gante

LGTM 👍

Can we edit the PR title to a shorter one before merging? 😅

susnato · 2023-01-20T13:09:43Z

LGTM +1

Can we edit the PR title to a shorter one before merging? sweat_smile

@gante Done!

gante · 2023-01-20T13:10:44Z

Awesome!

Thank you for all the work you've put into fixing this, @susnato 🤗

gante reviewed Jan 18, 2023

View reviewed changes

sgugger approved these changes Jan 18, 2023

View reviewed changes

Comment thread src/transformers/models/albert/modeling_tf_albert.py Outdated

Copy link
Copy Markdown

Collaborator

sgugger Jan 18, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, that would be great!

susnato added 7 commits January 19, 2023 22:21

deleted references of self.vocab_size & self.type_vocab_size for bert

815ce2d

deleted references of self.vocab_size & self.type_vocab_size for albert

3627bb0

deleted references of self.vocab_size & self.type_vocab_size for lxmert

e085027

deleted references of self.vocab_size & self.type_vocab_size for conv…

46348d2

…bert

deleted references of self.vocab_size & self.type_vocab_size for electra

e5bf861

deleted references of self.vocab_size & self.type_vocab_size for layo…

0a9a2fd

…utlm

deleted references of self.vocab_size & self.type_vocab_size for tapas

0f75e19

susnato force-pushed the resize_token_embeddings_models branch from 3772de9 to 0f75e19 Compare January 20, 2023 05:32

susnato changed the title ~~deleted references of self.vocab_size only for albert~~ deleted references of self.vocab_size for bert, albert, lxmert, electra, tapas, convbert, layoutlm Jan 20, 2023

deleted references of self.vocab_size & self.type_vocab_size for gpt2

c08d979

susnato changed the title ~~deleted references of self.vocab_size for bert, albert, lxmert, electra, tapas, convbert, layoutlm~~ deleted references of self.vocab_size for bert, albert, lxmert, electra, tapas, convbert, layoutlm, gpt2 Jan 20, 2023

susnato changed the title ~~deleted references of self.vocab_size for bert, albert, lxmert, electra, tapas, convbert, layoutlm, gpt2~~ deleted references of self.vocab_size for bert, albert, lxmert, electra, tapas, convbert, layoutlm, gpt2 [TF] Jan 20, 2023

susnato added 10 commits January 20, 2023 12:45

deleted references of self.vocab_size & self.type_vocab_size for came…

0db1383

…mbert

deleted references of self.vocab_size & self.type_vocab_size for clip

65c39ba

deleted references of self.vocab_size & self.type_vocab_size for ctrl

cdfeac9

deleted references of self.vocab_size & self.type_vocab_size for deberta

523e7f1

deleted references of self.vocab_size & self.type_vocab_size for debe…

264185c

…rta_v2

deleted references of self.vocab_size & self.type_vocab_size for dist…

23b9de6

…ilbert

deleted references of self.vocab_size & self.type_vocab_size for esm

652a227

deleted references of self.vocab_size & self.type_vocab_size for funnel

b1883e9

deleted references of self.vocab_size & self.type_vocab_size for gptj

45ed82e

deleted references of self.vocab_size & self.type_vocab_size for grou…

299c139

…pvit

susnato added 2 commits January 20, 2023 16:07

deleted references of self.vocab_size & self.type_vocab_size for long…

7c3a4aa

…former

deleted references of self.vocab_size & self.type_vocab_size for mobi…

e13b85e

…lebert

susnato added 2 commits January 20, 2023 16:19

deleted references of self.vocab_size & self.type_vocab_size for mpnet

ebe5072

deleted references of self.vocab_size & self.type_vocab_size for openai

a5dd771

susnato added 6 commits January 20, 2023 16:31

deleted references of self.vocab_size & self.type_vocab_size for rembert

50ea5c7

deleted references of self.vocab_size & self.type_vocab_size for roberta

8012329

deleted references of self.vocab_size & self.type_vocab_size for robe…

2c67e90

…rta_prelayernorm

deleted references of self.vocab_size & self.type_vocab_size for rofo…

4989a85

…rmer

deleted references of self.vocab_size & self.type_vocab_size for xlm_…

00f80cd

…roberta

deleted references of self.vocab_size & self.type_vocab_size for xlnet

3ac6e6f

gante approved these changes Jan 20, 2023

View reviewed changes

gante merged commit 202d686 into huggingface:main Jan 20, 2023

Conversation

susnato commented Jan 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

susnato commented Jan 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jan 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

gante Jan 18, 2023

Choose a reason for hiding this comment

Uh oh!

sgugger Jan 18, 2023

Choose a reason for hiding this comment

Uh oh!

susnato Jan 19, 2023

Choose a reason for hiding this comment

Uh oh!

gante Jan 20, 2023

Choose a reason for hiding this comment

Uh oh!

gante commented Jan 18, 2023

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger Jan 18, 2023

Choose a reason for hiding this comment

Uh oh!

gante commented Jan 20, 2023

Uh oh!

susnato commented Jan 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gante left a comment

Choose a reason for hiding this comment

Uh oh!

susnato commented Jan 20, 2023

Uh oh!

gante commented Jan 20, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

susnato commented Jan 18, 2023 •

edited

Loading

susnato commented Jan 18, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 18, 2023 •

edited

Loading

susnato commented Jan 20, 2023 •

edited

Loading