fix(gemma3, gemma4): default token_type_ids to zeros for text-only training by jashshah999 · Pull Request #45222 · huggingface/transformers

jashshah999 · 2026-04-03T16:27:31Z

Summary

When using Gemma 3 or Gemma 4 for text-only supervised fine-tuning (no images), the forward pass raises a ValueError because token_type_ids / mm_token_type_ids is not provided. This happens because AutoTokenizer does not produce these fields -- only the multimodal Processor does.

The fix defaults to all-zeros when token_type_ids / mm_token_type_ids is None during training, instead of raising. When all zeros, is_vision is entirely False, so the bidirectional vision mask branch is skipped and a standard causal mask is produced -- which is exactly correct for text-only input.

Changes

modeling_gemma4.py / modular_gemma4.py: default mm_token_type_ids to torch.zeros(...) instead of raising ValueError
modeling_gemma3.py / modular_gemma3.py: same fix for token_type_ids (same root cause)

Fixes #45200

…aining Fixes huggingface#45200

github-actions · 2026-04-03T16:28:41Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: gemma3, gemma4

zucchini-nlp

See #45200 (comment)

mingxiang1006 · 2026-04-15T07:29:42Z

this fix is helpful, would be good to merge to main branch

zucchini-nlp · 2026-04-15T10:45:40Z

I think the user who opened a PR is a bot, so I will merge a fix myself today. Thanks for ping @mingxiang1006

jashshah999 · 2026-04-15T14:43:02Z

Hey. Not a bot I can do it still should I?

zucchini-nlp · 2026-04-15T15:31:05Z

Hey @jashshah999

I didn’t hear back on the previous comment, so I assumed it is a pure code-agent PR 😅 . Gemma4 model is pretty important to keep working correctly, and I went ahead and pushed a fix in a separate PR to unblock things faster (#45454)

Thanks for putting in effort tho, and hope this doesn't discourage you from contributing in the future!

fix(gemma3, gemma4): default token_type_ids to zeros for text-only tr…

cc6ea6a

…aining Fixes huggingface#45200

zucchini-nlp reviewed Apr 7, 2026

View reviewed changes

zucchini-nlp closed this Apr 15, 2026

evalstate mentioned this pull request Apr 28, 2026

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#41

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(gemma3, gemma4): default token_type_ids to zeros for text-only training#45222

fix(gemma3, gemma4): default token_type_ids to zeros for text-only training#45222
jashshah999 wants to merge 1 commit intohuggingface:mainfrom
jashshah999:fix/gemma-text-only-training

jashshah999 commented Apr 3, 2026

Uh oh!

github-actions Bot commented Apr 3, 2026

Uh oh!

zucchini-nlp left a comment

Uh oh!

mingxiang1006 commented Apr 15, 2026

Uh oh!

zucchini-nlp commented Apr 15, 2026

Uh oh!

jashshah999 commented Apr 15, 2026

Uh oh!

zucchini-nlp commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jashshah999 commented Apr 3, 2026

Summary

Changes

Uh oh!

github-actions Bot commented Apr 3, 2026

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

mingxiang1006 commented Apr 15, 2026

Uh oh!

zucchini-nlp commented Apr 15, 2026

Uh oh!

jashshah999 commented Apr 15, 2026

Uh oh!

zucchini-nlp commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants