Skip to content

[Attn] More old mask APIs#43924

Draft
vasqu wants to merge 5 commits intohuggingface:mainfrom
vasqu:fix-masks-p2
Draft

[Attn] More old mask APIs#43924
vasqu wants to merge 5 commits intohuggingface:mainfrom
vasqu:fix-masks-p2

Conversation

@vasqu
Copy link
Copy Markdown
Contributor

@vasqu vasqu commented Feb 11, 2026

WIP

@vasqu
Copy link
Copy Markdown
Contributor Author

vasqu commented Feb 11, 2026

run-slow: big_bird,blip_2,bridgetower,clap,flava,ibert,instructblip,instructblipvideo,tapas,vilt

@github-actions
Copy link
Copy Markdown
Contributor

This comment contains run-slow, running the specified jobs:

models: ["models/big_bird", "models/blip_2", "models/bridgetower", "models/clap", "models/flava", "models/ibert", "models/instructblip", "models/instructblipvideo", "models/tapas", "models/vilt"]
quantizations: []

@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@github-actions
Copy link
Copy Markdown
Contributor

CI Results

Workflow Run ⚙️

Commit Info

Context Commit Description
RUN 07369579 merge commit
PR d295b5c6 branch commit
main ae05b2ae base commit

Model CI Report

31 new failed tests from this PR 😭

  • blip_2:
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ForConditionalGenerationDecoderOnlyTest::test_eager_matches_sdpa_inference_00_fp16_pad_left_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ForConditionalGenerationDecoderOnlyTest::test_eager_matches_sdpa_inference_01_fp16_pad_left
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ForConditionalGenerationDecoderOnlyTest::test_eager_matches_sdpa_inference_02_fp16_pad_left_no_attn_mask_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ForConditionalGenerationDecoderOnlyTest::test_eager_matches_sdpa_inference_03_fp16_pad_left_no_attn_mask
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ForConditionalGenerationDecoderOnlyTest::test_eager_matches_sdpa_inference_04_fp16_pad_right_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ForConditionalGenerationDecoderOnlyTest::test_eager_matches_sdpa_inference_05_fp16_pad_right
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ForConditionalGenerationDecoderOnlyTest::test_eager_matches_sdpa_inference_06_fp16_pad_right_no_attn_mask_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ForConditionalGenerationDecoderOnlyTest::test_eager_matches_sdpa_inference_07_fp16_pad_right_no_attn_mask
    tests/models/blip_2/test_modeling_blip_2.py::Blip2VisionModelWithProjectionTest::test_eager_matches_sdpa_inference_00_fp16_pad_left_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2VisionModelWithProjectionTest::test_eager_matches_sdpa_inference_01_fp16_pad_left
    tests/models/blip_2/test_modeling_blip_2.py::Blip2VisionModelWithProjectionTest::test_eager_matches_sdpa_inference_02_fp16_pad_left_no_attn_mask_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2VisionModelWithProjectionTest::test_eager_matches_sdpa_inference_03_fp16_pad_left_no_attn_mask
    tests/models/blip_2/test_modeling_blip_2.py::Blip2VisionModelWithProjectionTest::test_eager_matches_sdpa_inference_04_fp16_pad_right_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2VisionModelWithProjectionTest::test_eager_matches_sdpa_inference_05_fp16_pad_right
    tests/models/blip_2/test_modeling_blip_2.py::Blip2VisionModelWithProjectionTest::test_eager_matches_sdpa_inference_06_fp16_pad_right_no_attn_mask_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2VisionModelWithProjectionTest::test_eager_matches_sdpa_inference_07_fp16_pad_right_no_attn_mask
    tests/models/blip_2/test_modeling_blip_2.py::Blip2TextRetrievalModelTest::test_eager_matches_sdpa_inference_00_fp16_pad_left_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2TextRetrievalModelTest::test_eager_matches_sdpa_inference_01_fp16_pad_left
    tests/models/blip_2/test_modeling_blip_2.py::Blip2TextRetrievalModelTest::test_eager_matches_sdpa_inference_02_fp16_pad_left_no_attn_mask_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2TextRetrievalModelTest::test_eager_matches_sdpa_inference_03_fp16_pad_left_no_attn_mask
    tests/models/blip_2/test_modeling_blip_2.py::Blip2TextRetrievalModelTest::test_eager_matches_sdpa_inference_04_fp16_pad_right_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2TextRetrievalModelTest::test_eager_matches_sdpa_inference_05_fp16_pad_right
    tests/models/blip_2/test_modeling_blip_2.py::Blip2TextRetrievalModelTest::test_eager_matches_sdpa_inference_06_fp16_pad_right_no_attn_mask_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2TextRetrievalModelTest::test_eager_matches_sdpa_inference_07_fp16_pad_right_no_attn_mask
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ModelIntegrationTest::test_inference_interpolate_pos_encoding
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ModelIntegrationTest::test_inference_itm_fp16
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ModelIntegrationTest::test_inference_opt
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ModelIntegrationTest::test_inference_opt_batched_beam_search
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ModelIntegrationTest::test_inference_vision_with_projection_fp16

  • instructblip:
    tests/models/instructblip/test_modeling_instructblip.py::InstructBlipForConditionalGenerationDecoderOnlyTest::test_torch_export

  • instructblipvideo:
    tests/models/instructblipvideo/test_modeling_instructblipvideo.py::InstructBlipVideoForConditionalGenerationDecoderOnlyTest::test_torch_export

@github-actions
Copy link
Copy Markdown
Contributor

[For maintainers] Suggested jobs to run (before merge)

run-slow: align, altclip, big_bird, blip, blip_2, bridgetower, bros, canine, chinese_clip, clap, convbert, flava, ibert, idefics, imagegpt, instructblip

@vasqu
Copy link
Copy Markdown
Contributor Author

vasqu commented Feb 11, 2026

run-slow: align,altclip,big_bird,blip,blip_2,bridgetower,bros,canine,chinese_clip,clap,convbert,flava,ibert,idefics,imagegpt,instructblip,instructblipvideo,layoutlmv3,lightglue,lilt,longformer,longt5,luke,megatron_bert,mpnet,mra,nystromformer,perceiver,pix2struct,pop2piano,rembert,roformer,splinter,squeezebert,superglue,switch_transformers,tapas,tvp,udop,umt5,vilt,visual_bert

@github-actions
Copy link
Copy Markdown
Contributor

This comment contains run-slow, running the specified jobs:

models: ["models/align", "models/altclip", "models/big_bird", "models/blip", "models/blip_2", "models/bridgetower", "models/bros", "models/canine", "models/chinese_clip", "models/clap", "models/convbert", "models/flava", "models/ibert", "models/idefics", "models/imagegpt", "models/instructblip", "models/instructblipvideo", "models/layoutlmv3", "models/lightglue", "models/lilt", "models/longformer", "models/longt5", "models/luke", "models/megatron_bert", "models/mpnet", "models/mra", "models/nystromformer", "models/perceiver", "models/pix2struct", "models/pop2piano", "models/rembert", "models/roformer", "models/splinter", "models/squeezebert", "models/superglue", "models/switch_transformers", "models/tapas", "models/tvp", "models/udop", "models/umt5", "models/vilt", "models/visual_bert"]
quantizations: []

@github-actions
Copy link
Copy Markdown
Contributor

CI Results

Workflow Run ⚙️

Commit Info

Context Commit Description
RUN 03bccdd1 merge commit
PR f79c3a01 branch commit
main ae05b2ae base commit

Model CI Report

39 new failed tests from this PR 😭

  • blip_2:
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ForConditionalGenerationDecoderOnlyTest::test_eager_matches_sdpa_inference_00_fp16_pad_left_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ForConditionalGenerationDecoderOnlyTest::test_eager_matches_sdpa_inference_01_fp16_pad_left
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ForConditionalGenerationDecoderOnlyTest::test_eager_matches_sdpa_inference_02_fp16_pad_left_no_attn_mask_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ForConditionalGenerationDecoderOnlyTest::test_eager_matches_sdpa_inference_03_fp16_pad_left_no_attn_mask
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ForConditionalGenerationDecoderOnlyTest::test_eager_matches_sdpa_inference_04_fp16_pad_right_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ForConditionalGenerationDecoderOnlyTest::test_eager_matches_sdpa_inference_05_fp16_pad_right
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ForConditionalGenerationDecoderOnlyTest::test_eager_matches_sdpa_inference_06_fp16_pad_right_no_attn_mask_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ForConditionalGenerationDecoderOnlyTest::test_eager_matches_sdpa_inference_07_fp16_pad_right_no_attn_mask
    tests/models/blip_2/test_modeling_blip_2.py::Blip2VisionModelWithProjectionTest::test_eager_matches_sdpa_inference_00_fp16_pad_left_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2VisionModelWithProjectionTest::test_eager_matches_sdpa_inference_01_fp16_pad_left
    tests/models/blip_2/test_modeling_blip_2.py::Blip2VisionModelWithProjectionTest::test_eager_matches_sdpa_inference_02_fp16_pad_left_no_attn_mask_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2VisionModelWithProjectionTest::test_eager_matches_sdpa_inference_03_fp16_pad_left_no_attn_mask
    tests/models/blip_2/test_modeling_blip_2.py::Blip2VisionModelWithProjectionTest::test_eager_matches_sdpa_inference_04_fp16_pad_right_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2VisionModelWithProjectionTest::test_eager_matches_sdpa_inference_05_fp16_pad_right
    tests/models/blip_2/test_modeling_blip_2.py::Blip2VisionModelWithProjectionTest::test_eager_matches_sdpa_inference_06_fp16_pad_right_no_attn_mask_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2VisionModelWithProjectionTest::test_eager_matches_sdpa_inference_07_fp16_pad_right_no_attn_mask
    tests/models/blip_2/test_modeling_blip_2.py::Blip2TextRetrievalModelTest::test_eager_matches_sdpa_inference_00_fp16_pad_left_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2TextRetrievalModelTest::test_eager_matches_sdpa_inference_01_fp16_pad_left
    tests/models/blip_2/test_modeling_blip_2.py::Blip2TextRetrievalModelTest::test_eager_matches_sdpa_inference_02_fp16_pad_left_no_attn_mask_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2TextRetrievalModelTest::test_eager_matches_sdpa_inference_03_fp16_pad_left_no_attn_mask
    tests/models/blip_2/test_modeling_blip_2.py::Blip2TextRetrievalModelTest::test_eager_matches_sdpa_inference_04_fp16_pad_right_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2TextRetrievalModelTest::test_eager_matches_sdpa_inference_05_fp16_pad_right
    tests/models/blip_2/test_modeling_blip_2.py::Blip2TextRetrievalModelTest::test_eager_matches_sdpa_inference_06_fp16_pad_right_no_attn_mask_sdpa_kernels
    tests/models/blip_2/test_modeling_blip_2.py::Blip2TextRetrievalModelTest::test_eager_matches_sdpa_inference_07_fp16_pad_right_no_attn_mask
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ModelIntegrationTest::test_inference_interpolate_pos_encoding
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ModelIntegrationTest::test_inference_itm_fp16
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ModelIntegrationTest::test_inference_opt
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ModelIntegrationTest::test_inference_opt_batched_beam_search
    tests/models/blip_2/test_modeling_blip_2.py::Blip2ModelIntegrationTest::test_inference_vision_with_projection_fp16

  • idefics:
    tests/models/idefics/test_modeling_idefics.py::IdeficsModelIntegrationTest::test_inference_natural_language_visual_reasoning

  • lightglue:
    tests/models/lightglue/test_modeling_lightglue.py::LightGlueModelTest::test_eager_matches_sdpa_inference_08_fp32_pad_left_sdpa_kernels
    tests/models/lightglue/test_modeling_lightglue.py::LightGlueModelTest::test_eager_matches_sdpa_inference_09_fp32_pad_left
    tests/models/lightglue/test_modeling_lightglue.py::LightGlueModelTest::test_eager_matches_sdpa_inference_10_fp32_pad_left_no_attn_mask_sdpa_kernels
    tests/models/lightglue/test_modeling_lightglue.py::LightGlueModelTest::test_eager_matches_sdpa_inference_11_fp32_pad_left_no_attn_mask
    tests/models/lightglue/test_modeling_lightglue.py::LightGlueModelTest::test_eager_matches_sdpa_inference_12_fp32_pad_right_sdpa_kernels
    tests/models/lightglue/test_modeling_lightglue.py::LightGlueModelTest::test_eager_matches_sdpa_inference_13_fp32_pad_right
    tests/models/lightglue/test_modeling_lightglue.py::LightGlueModelTest::test_eager_matches_sdpa_inference_14_fp32_pad_right_no_attn_mask_sdpa_kernels
    tests/models/lightglue/test_modeling_lightglue.py::LightGlueModelTest::test_eager_matches_sdpa_inference_15_fp32_pad_right_no_attn_mask
    tests/models/lightglue/test_modeling_lightglue.py::LightGlueModelTest::test_eager_matches_sdpa_inference_24_fp32_pad_left_output_attentions

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants