Generalize gemma vision mask to videos by zucchini-nlp · Pull Request #45185 · huggingface/transformers

zucchini-nlp · 2026-04-02T11:35:53Z

What does this PR do?

If we have videos, the token type ids will be 2 but the current fn checks only image token types. This PR generalizes it rely only on vision_group_ids instead of token types

zucchini-nlp · 2026-04-02T11:36:17Z

run-slow: gemma3, paligemma, git

github-actions · 2026-04-02T11:37:35Z

Workflow Run ⚙️

This comment contains run-slow, running the specified jobs:

models: ["models/gemma3", "models/git", "models/paligemma"]
quantizations: []

HuggingFaceDocBuilderDev · 2026-04-02T11:44:59Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker · 2026-04-02T11:56:38Z

    This function adds the correct offsets to the `q_idx` and `kv_idx` as the torch API can only accept lengths,
    not start and end indices.
+    Args:
+        vision_group_ids (`torch.Tensor`):


Suggested change

vision_group_ids (`torch.Tensor`):

group_ids (`torch.Tensor`):

as it includes text group as well

github-actions · 2026-04-02T12:21:21Z

CI Results

Workflow Run ⚙️

Commit Info

Context	Commit	Description
RUN	8fcab2dd	workflow commit (merge commit)
PR	c05182ba	branch commit (from PR)
main	abc417a4	base commit (on `main`)

✅ No failing test specific to this PR 🎉 👏 !

github-actions · 2026-04-02T12:35:58Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: gemma3, git, paligemma

* more general inner mask * arthur's comment - rename everywhere

more general inner mask

c05182b

ArthurZucker approved these changes Apr 2, 2026

View reviewed changes

arthur's comment - rename everywhere

8d7904d

zucchini-nlp enabled auto-merge April 2, 2026 12:35

zucchini-nlp added this pull request to the merge queue Apr 2, 2026

Merged via the queue into huggingface:main with commit ade7a05 Apr 2, 2026
22 checks passed

zucchini-nlp deleted the gemma-masks branch April 2, 2026 13:15

marvinzh pushed a commit to marvinzh/transformers that referenced this pull request Apr 3, 2026

Generalize gemma vision mask to videos (huggingface#45185)

6cb79d5

* more general inner mask * arthur's comment - rename everywhere

SangbumChoi pushed a commit to SangbumChoi/transformers that referenced this pull request Apr 4, 2026

Generalize gemma vision mask to videos (huggingface#45185)

cc60315

* more general inner mask * arthur's comment - rename everywhere

sirzechs66 pushed a commit to sirzechs66/transformers that referenced this pull request Apr 18, 2026

Generalize gemma vision mask to videos (huggingface#45185)

ca23581

* more general inner mask * arthur's comment - rename everywhere

evalstate mentioned this pull request Apr 28, 2026

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#41

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generalize gemma vision mask to videos#45185

Generalize gemma vision mask to videos#45185
zucchini-nlp merged 2 commits intohuggingface:mainfrom
zucchini-nlp:gemma-masks

zucchini-nlp commented Apr 2, 2026

Uh oh!

zucchini-nlp commented Apr 2, 2026

Uh oh!

github-actions Bot commented Apr 2, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 2, 2026

Uh oh!

ArthurZucker Apr 2, 2026

Uh oh!

ArthurZucker Apr 2, 2026

Uh oh!

github-actions Bot commented Apr 2, 2026

Uh oh!

github-actions Bot commented Apr 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	vision_group_ids (`torch.Tensor`):
	group_ids (`torch.Tensor`):

Conversation

zucchini-nlp commented Apr 2, 2026

What does this PR do?

Uh oh!

zucchini-nlp commented Apr 2, 2026

Uh oh!

github-actions Bot commented Apr 2, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 2, 2026

Uh oh!

ArthurZucker Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Apr 2, 2026

CI Results

Commit Info

Uh oh!

github-actions Bot commented Apr 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants