[`resize_embedding`] Introduce `pad_to_multiple_of` and guidance by ArthurZucker · Pull Request #25088 · huggingface/transformers

ArthurZucker · 2023-07-25T17:09:42Z

What does this PR do?

Fixes #22312.
After internal discussions, it appears that adding the possibility to pad with -1 to tokenizers is not really feasible ( nor is it desirable).
However, what we can do is by default resize the embedding layer to the nearest size that is optimal for the dtype of the model following this.

Motivations:

the _get_resized_embeddings is not exposed, and thus making this automatic can be a big silent win.
if properly documented, should not really have issues.

Cons:

it is not backward compatible, so some kind of config.optimise_resize might be needed?
it is hidden and people might not really get why tokenizer.vocab_size will be different than the model's embedding dimension.

HuggingFaceDocBuilderDev · 2023-07-25T17:29:08Z

The documentation is not available anymore as the PR was closed or merged.

ArthurZucker

All breaking changes happen in the internal and non exposed part of the functions.

…to pad-tok-negativ

sgugger

Thanks for cleaning this up!

@blessedcoolant

## What type of PR is this? (check all applicable) - [ ] Refactor - [ ] Feature - [X] Bug Fix - [X] Optimization - [ ] Documentation Update - [ ] Community Node Submission ## Have you discussed this change with the InvokeAI team? - [X] Yes, with @blessedcoolant - [ ] No, because: ## Have you updated all relevant documentation? - [ ] Yes - [ ] No ## Description This PR updates Transformers to the most recent version and fixes the value `pad_to_multiple_of` for `text_encoder.resize_token_embeddings` which was introduced with huggingface/transformers#25088 in Transformers 4.32.0. According to the [Nvidia Documentation](https://docs.nvidia.com/deeplearning/performance/dl-performance-matrix-multiplication/index.html#requirements-tc), `Performance is better when equivalent matrix dimensions M, N, and K are aligned to multiples of 8 bytes (or 64 bytes on A100) for FP16` This fixes the following error that was popping up before every invocation starting with Transformers 4.32.0 `You are resizing the embedding layer without providing a pad_to_multiple_of parameter. This means that the new embedding dimension will be None. This might induce some performance reduction as Tensor Cores will not be available. For more details about this, or help on choosing the correct value for resizing, refer to this guide: https://docs.nvidia.com/deeplearning/performance/dl-performance-matrix-multiplication/index.html#requirements-tc` This is my first "real" fix PR, so I hope this is fine. Please inform me if there is anything wrong with this. I am glad to help. Have a nice day and thank you! ## Related Tickets & Documents  - Related Issue: huggingface/transformers#26303 - Related Discord discussion: https://discord.com/channels/1020123559063990373/1154152783579197571 - Closes # ## QA Instructions, Screenshots, Recordings  ## Added/updated tests? - [ ] Yes - [ ] No : _please replace this line with details on why tests have not been included_ ## [optional] Are there any post deployment tasks we need to perform?

ArthurZucker mentioned this pull request Jul 25, 2023

LlamaTokenizer has no pad token, leading to failure during batch-tokenization #22312

Closed

4 tasks

fix

3cb0d07

ArthurZucker force-pushed the pad-tok-negativ branch from 15ff141 to 3cb0d07 Compare July 31, 2023 11:03

ArthurZucker added 4 commits July 31, 2023 15:04

revert cahnges and update resizing of embedding layer

09ac6ed

use wraning

1213a87

fixup

84fdd70

more styling nits

0da1d34

ArthurZucker changed the title ~~[Tokenization] Introduce support for negative index padding when people forget to have a padding token~~ [resize_embedding] Introduce pad_to_multiple_of and guidance Aug 1, 2023

ArthurZucker commented Aug 1, 2023

View reviewed changes

ArthurZucker marked this pull request as ready for review August 1, 2023 08:42

ArthurZucker added 11 commits August 1, 2023 08:52

fix all tests that overload the embedding tests

2917cb8

👀👀 remove breakpoint

98f508e

remove useless overload + overload correctly where needed

6d266e1

resize lm head with new vocab size

798bb0d

reverse not necessary changes

21c5c01

style

7795390

fix CIs!

b576c05

fix last CI tests, adapt bark and Marian

2c0c26e

fixup

9d40290

Merge branch 'main' of https://github.com/huggingface/transformers in…

eedd99f

…to pad-tok-negativ

Merge branch 'main' of https://github.com/huggingface/transformers in…

9d77452

…to pad-tok-negativ

ArthurZucker requested a review from sgugger August 17, 2023 14:20

sgugger approved these changes Aug 17, 2023

View reviewed changes

ArthurZucker merged commit d6bf08f into huggingface:main Aug 17, 2023

ArthurZucker deleted the pad-tok-negativ branch August 17, 2023 15:00

ArthurZucker mentioned this pull request Aug 28, 2023

Support for -1 for pad token id to follow Metas LLama implementation #25773

Closed

4 tasks

passaglia mentioned this pull request Sep 12, 2023

[FIX] resize_token_embeddings #26102

Merged

keturn mentioned this pull request Sep 20, 2023

resize_token_embeddings warning should provide more context and be easier to squelch #26303

Closed

4 tasks

Malrama mentioned this pull request Oct 6, 2023

Update Transformers to 4.35 and fix pad_to_multiple_of invoke-ai/InvokeAI#4817

Merged

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`resize_embedding`] Introduce `pad_to_multiple_of` and guidance#25088

[`resize_embedding`] Introduce `pad_to_multiple_of` and guidance#25088
ArthurZucker merged 16 commits intohuggingface:mainfrom
ArthurZucker:pad-tok-negativ

ArthurZucker commented Jul 25, 2023 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jul 25, 2023 •

edited

Loading

Uh oh!

ArthurZucker left a comment

Uh oh!

sgugger left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ArthurZucker commented Jul 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Jul 25, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

sgugger left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ArthurZucker commented Jul 25, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 25, 2023 •

edited

Loading