Fix Failed tests with mobile bert resize tokens embedding by abuelnasr0 · Pull Request #33950 · huggingface/transformers

abuelnasr0 · 2024-10-04T15:45:59Z

Fixes the failures introduced by #33325

The tests failed with mobilebert because of a missing transposing for the old_lm_head. This PR fixes that. I have tried the two failed tests locally.
It's weird that all tests passed before merging. EDIT: I see now, some tests were skipped

I have also changed the logic when the covariance matrix is not positive definite, just initialize the new embeddings with the mean if covariance is not positive definite.

c.c. @ArthurZucker

ArthurZucker

SG to me, why were the tests skipped?

ArthurZucker · 2024-10-04T20:02:23Z

Ah could you update tests/models/recurrent_gemma/test_modeling_recurrent_gemma.py::RecurrentGemmaModelTest::test_resize_tokens_embeddings as welll?

ArthurZucker · 2024-10-04T20:22:21Z

Ah and tests/models/git/test_modeling_git.py::GitModelTest::test_resize_tokens_embeddings - AssertionError: Padding_idx must be within num_embeddings as well

ArthurZucker · 2024-10-04T20:22:31Z

See here 😅 https://app.circleci.com/pipelines/github/huggingface/transformers/106584/workflows/539a0425-e7e1-4a82-aedb-d7134aa524ba/jobs/1415704

HuggingFaceDocBuilderDev · 2024-10-04T20:26:00Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

abuelnasr0 · 2024-10-04T21:42:57Z

@ArthurZucker I have addressed mobilebert test, GitModeltest, and recurrent_gemma test.

recurrent_gemma test failed because an outlier was sampled, so I multiplied covariance by 1e-9 instead of 1e-5.
Git model tests failed because the config was overwritten after the first resizing test and used to initialize the model again, so I created a new copy for the new model initialization.

abuelnasr0 · 2024-10-04T21:59:31Z

I feel bad about those tests' failures actually, I wanted to deliver good code but tests didn't help me. 😅

The outlier of recurrent_gemma got sampled only after merging the code haha.

And I am not sure if other tests were actually skipped before merging. It's weird.
Do you know why the test failures appeared after merging? mobilebert and GitModeltest?

ArthurZucker · 2024-10-05T14:25:48Z

Yep! They are not part of the important model, the test fetcher seems to badly behave! It should have found out the whole dependencies!

No worries, we are the ones who set you up for failure in that case!

cc @ydshieh if you can have a look at the reasons why this was not fetched when you have time!

ArthurZucker

thanks 🤗

ArthurZucker · 2024-10-05T14:27:12Z

+            # Copy again. config changed with embedding resizing (`vocab_size` changed)
+            config = copy.deepcopy(original_config)
            config.vocab_size = 4
+            config.pad_token_id = 3


what was this failing?

The pad_toke_id in the config is 98 for the GitModel. This results in an error in the embedding layer because it's higher than the vocab_size
This error appeared in the GitModel after fixing the error which was caused by overwriting the configuration.

ydshieh · 2024-10-07T09:02:28Z

Yep! They are not part of the important model, the test fetcher seems to badly behave! It should have found out the whole dependencies!

No worries, we are the ones who set you up for failure in that case!

cc @ydshieh if you can have a look at the reasons why this was not fetched when you have time!

Could one of you provide a link of the previous (PR) job run page where you believe there is something being missed by the test fetcher?

ArthurZucker · 2024-10-07T09:30:26Z

Yes! https://app.circleci.com/pipelines/github/huggingface/transformers/106469/workflows/2ce53b8a-9a18-4e16-9b94-fd3b786c0916/jobs/1414187 is all green, but a few hours later: https://app.circleci.com/pipelines/github/huggingface/transformers/106724/workflows/fb68022e-2d64-44e9-91a5-f86c18f73cdd/jobs/1417162 or any other test torch on main would fail!

ArthurZucker · 2024-10-07T09:42:21Z

In the mean time @abuelnasr0 could you commit with a message test_all 🤗

ArthurZucker · 2024-10-07T09:42:30Z

This way all models should be ran!

ArthurZucker · 2024-10-07T12:57:46Z

A few failing tests still!

ArthurZucker · 2024-10-07T12:58:21Z

RUN_SLOW=1 pytest -n 4 tests/models/*/test_modeling_* -k test_resize_tokens_embeddings to filter!

gante · 2024-10-09T09:39:04Z

I hope you don't mind @abuelnasr0 -- this PR is blocking other PRs, so I'm taking care of the rest of the fixes 🤗

gante · 2024-10-09T10:24:32Z

(@ArthurZucker -- all tests in pytest tests/models/ -k test_resize_tokens_embeddings were green)

abuelnasr0 · 2024-10-09T10:46:53Z

@gante No problem. That is completely fine.
I am sorry for blocking other PRs! Thanks for the fixes!

ArthurZucker · 2024-10-09T11:29:46Z

@gante thanks for fixing it!

…e#33950) * Fix Failed tests with mobile bert * Cast to the correct dtype * Code fixup * Fix padding_idx larger that embedding_size * Reduce covariance more. use 1e-7 instead of 1e-5 * Comment fix * Reduce covariance more. use 1e-9 instead of 1e-7 * Copy new config * all but MRA fixed * fix mra * very flaky * skip instead * make fixup --------- Co-authored-by: Joao Gante <joao@huggingface.co>

abuelnasr0 added 3 commits October 4, 2024 18:39

Fix Failed tests with mobile bert

642d49e

Cast to the correct dtype

7a6afcf

Code fixup

b68a455

abuelnasr0 changed the title ~~Fix Failed tests with mobile bert~~ Fix Failed tests with mobile bert resize tokens embedding Oct 4, 2024

ArthurZucker approved these changes Oct 4, 2024

View reviewed changes

abuelnasr0 added 4 commits October 5, 2024 00:04

Fix padding_idx larger that embedding_size

88cc47c

Reduce covariance more. use 1e-7 instead of 1e-5

2ff71e7

Comment fix

ea4ecf3

Reduce covariance more. use 1e-9 instead of 1e-7

31ad4f7

Copy new config

2a1b7e7

ArthurZucker approved these changes Oct 5, 2024

View reviewed changes

gante mentioned this pull request Oct 7, 2024

Universal Assisted Generation: Assisted generation with any assistant model (by Intel Labs) #33383

Merged

5 tasks

gante mentioned this pull request Oct 7, 2024

[fix] fix token healing tests and usage errors #33931

Merged

5 tasks

gante added 3 commits October 9, 2024 09:39

all but MRA fixed

835e830

fix mra

b9cca75

very flaky

94b78d7

gante added 2 commits October 9, 2024 10:09

skip instead

2cbcad8

make fixup

e76d1a8

gante merged commit cdee528 into huggingface:main Oct 9, 2024

abuelnasr0 mentioned this pull request Oct 9, 2024

check if eigenvalues of covariance matrix are complex. #34037

Merged

Conversation

abuelnasr0 commented Oct 4, 2024 • edited by ArthurZucker Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker commented Oct 4, 2024

Uh oh!

ArthurZucker commented Oct 4, 2024

Uh oh!

ArthurZucker commented Oct 4, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Oct 4, 2024

Uh oh!

abuelnasr0 commented Oct 4, 2024

Uh oh!

abuelnasr0 commented Oct 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArthurZucker commented Oct 5, 2024

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ArthurZucker Oct 5, 2024

Choose a reason for hiding this comment

Uh oh!

abuelnasr0 Oct 6, 2024

Choose a reason for hiding this comment

Uh oh!

ydshieh commented Oct 7, 2024

Uh oh!

ArthurZucker commented Oct 7, 2024

Uh oh!

ArthurZucker commented Oct 7, 2024

Uh oh!

ArthurZucker commented Oct 7, 2024

Uh oh!

ArthurZucker commented Oct 7, 2024

Uh oh!

ArthurZucker commented Oct 7, 2024

Uh oh!

gante commented Oct 9, 2024

Uh oh!

gante commented Oct 9, 2024

Uh oh!

abuelnasr0 commented Oct 9, 2024

Uh oh!

ArthurZucker commented Oct 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

abuelnasr0 commented Oct 4, 2024 •

edited by ArthurZucker

Loading

abuelnasr0 commented Oct 4, 2024 •

edited

Loading