[`OWL-VIT`] Added sdpa attention by nileshkokane01 · Pull Request #28818 · huggingface/transformers

nileshkokane01 · 2024-02-01T11:02:52Z

What does this PR do?

This PR add sdpa attention for OWL-ViT.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@NielsRogge @younesbelkada
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

nileshkokane01 · 2024-02-01T11:31:02Z

I have added an initial draft for sdpa attention, but I guess, more changes are needed as the OWL-ViT is a bit different compared to llama or Mistral.

Can you please point me out a similar model close to OWL-ViT or rather let me know the additional changes required in the file. Also, casual_attention_mask is not handled correctly - have no clue how to handle. Additionally, a corresponding test case is also necessary.

ArthurZucker · 2024-02-01T13:21:23Z

fyi @younesbelkada

younesbelkada

Good work, thanks ! I left few nits, and a suggestion to fix the failing CI
Could you follow the same logic as here: https://github.com/huggingface/transformers/blob/main/src/transformers/models/llama/modeling_llama.py#L1032-L1045 for preparing the attention mask for SDPA ?
Also similarly as Llama, could you add _supports_sdpa=True in OwlViTPreTrainedModel ? That way the tests would be triggered (e.g.:

transformers/src/transformers/models/llama/modeling_llama.py

Line 854 in abbffc4

_supports_sdpa = True

)

younesbelkada

Hi @nileshkokane01
Thanks a lot for your hardwork ! It looks much cleaner ! We're almost there - it seems some tests are failing:

FAILED tests/models/owlvit/test_modeling_owlvit.py::OwlViTTextModelTest::test_model_outputs_equivalence - TypeError: _prepare_4d_causal_attention_mask_for_sdpa() missing 1 required positional argument: 'past_key_values_length'

Are you able to repro these failures locally? I think the fix should be to hardcode past_key_values_length to 0 during the call of that method as OwlViT is not a generative text model, hence does not use caching mechanism

nileshkokane01 · 2024-02-05T06:26:11Z

@younesbelkada ,
I'm trying to solve the errors. I'll let you know when its all ready or if I need any assistance.

nileshkokane01 · 2024-02-05T08:31:20Z

@younesbelkada ,

I get the following error since the batch size is dropped, and therefore the dimensionality is not matching. Any clues ?

[192, 16, 16] doesn't match the broadcast shape [48, 192, 16 ,16]

Also causal_attention_mask is not used at all in sdpa; don't know how to handle it on below line.

https://github.com/nileshkokane01/transformers/blob/sdpa_for_OWL_ViT/src/transformers/models/owlvit/modeling_owlvit.py#L484

nileshkokane01 · 2024-02-17T14:49:40Z

@younesbelkada ,

I sought of tried to fix the dimensionality mismatch for batch size , but couldn't figure out. Any clue ?

RuntimeError: output with shape 
[192, 16, 16] doesn't match the broadcast shape [48, 192, 16, 16]

with these 11 test seems to fail.

younesbelkada · 2024-02-19T00:17:04Z

Hi @nileshkokane01
Thanks for getting back ! For that I need to deep dive into your branch and try to fix things, I will do that in the next days 🙏

github-actions · 2024-04-09T08:04:41Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

younesbelkada reviewed Feb 2, 2024

View reviewed changes

Comment thread src/transformers/models/owlvit/modeling_owlvit.py Outdated

Comment thread src/transformers/models/owlvit/modeling_owlvit.py Outdated

nileshkokane01 added 4 commits February 3, 2024 18:13

Added sdpa attention

251683a

Added Changes to OWL-Vit as suggested

8094a48

Fixed nits

e81d53c

removed unwanted files

4fb902e

nileshkokane01 force-pushed the sdpa_for_OWL_ViT branch from 06bdf84 to 4fb902e Compare February 3, 2024 12:45

Fixed nits

b77797c

younesbelkada reviewed Feb 4, 2024

View reviewed changes

Fixed past_key_values_length to length 0

dd47cd6

Fixed nits

74c4e8e

Fixed dim issue

3c056eb

nileshkokane01 force-pushed the sdpa_for_OWL_ViT branch from 61fcbce to 3c056eb Compare February 19, 2024 14:19

fixed nits

84ad34e

amyeroberts changed the title ~~Added sdpa attention~~ [OWL-VIT] Added sdpa attention Feb 19, 2024

Fixed dim issue

3b961ab

huggingface deleted a comment from github-actions Bot Mar 15, 2024

github-actions Bot closed this Apr 17, 2024

This was referenced Nov 4, 2025

T5 migration to new masking interface #41804

Merged

Sdpa for owlvit #42136

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`OWL-VIT`] Added sdpa attention#28818

[`OWL-VIT`] Added sdpa attention#28818
nileshkokane01 wants to merge 10 commits intohuggingface:mainfrom
nileshkokane01:sdpa_for_OWL_ViT

nileshkokane01 commented Feb 1, 2024 •

edited

Loading

Uh oh!

nileshkokane01 commented Feb 1, 2024

Uh oh!

ArthurZucker commented Feb 1, 2024

Uh oh!

younesbelkada left a comment

Uh oh!

Uh oh!

Uh oh!

younesbelkada left a comment

Uh oh!

nileshkokane01 commented Feb 5, 2024

Uh oh!

nileshkokane01 commented Feb 5, 2024

Uh oh!

nileshkokane01 commented Feb 17, 2024

Uh oh!

younesbelkada commented Feb 19, 2024

Uh oh!

github-actions Bot commented Apr 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

nileshkokane01 commented Feb 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

nileshkokane01 commented Feb 1, 2024

Uh oh!

ArthurZucker commented Feb 1, 2024

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

younesbelkada left a comment

Choose a reason for hiding this comment

Uh oh!

nileshkokane01 commented Feb 5, 2024

Uh oh!

nileshkokane01 commented Feb 5, 2024

Uh oh!

nileshkokane01 commented Feb 17, 2024

Uh oh!

younesbelkada commented Feb 19, 2024

Uh oh!

github-actions Bot commented Apr 9, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

nileshkokane01 commented Feb 1, 2024 •

edited

Loading