[chat template] return assistant mask in processors by zucchini-nlp · Pull Request #38545 · huggingface/transformers

zucchini-nlp · 2025-06-03T07:18:30Z

What does this PR do?

Fixes #38521. I checked with fast tokenizers' implementation of word_to_char and saw no difference in the time taken, so I think this can be the permanent solution

Otherwise we can add in BatchFeature support for EncodingFast features, though I don't think anyone needs them and I have never seen user requesting it

HuggingFaceDocBuilderDev · 2025-06-03T07:32:58Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp · 2025-06-12T10:18:31Z

Ready for review!

zucchini-nlp · 2025-06-12T10:20:29Z

+        is_tokenizers_fast = hasattr(self, "tokenizer") and self.tokenizer.__class__.__name__.endswith("Fast")
+


I see that tokenizer's never checks this, probably because all new LLMs support fast tokenizers. Though users can force set use_fast=False for some reasons and the error message in that case is not informative

Should I add the check on tokenizer's apply_chat_template as well, WDYT?

Rocketknight1

Sorry for taking so long to get to this! The logic makes sense, but the use of bisect_left confused me for a bit. After staring at it for a while, though, I think it's valid.

github-actions · 2025-07-18T12:11:41Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: csm, shieldgemma2

* messed up the git history, squash commits * raise error if slow and refine tests * index was off by one * fix the test

messed up the git history, squash commits

61112a1

zucchini-nlp force-pushed the chat-template-assistant-mask branch from 7d3e2a9 to 61112a1 Compare June 3, 2025 07:19

zucchini-nlp added 2 commits June 12, 2025 11:08

raise error if slow and refine tests

5ed3442

index was off by one

f5623a7

zucchini-nlp changed the title ~~[WIP chat template] return assistant mask in processors~~ [chat template] return assistant mask in processors Jun 12, 2025

fix the test

025afdb

zucchini-nlp requested a review from Rocketknight1 June 12, 2025 10:18

zucchini-nlp commented Jun 12, 2025

View reviewed changes

zucchini-nlp added 2 commits June 17, 2025 13:47

Merge branch 'main' into chat-template-assistant-mask

451450d

Merge branch 'main' into chat-template-assistant-mask

6dd6265

Rocketknight1 approved these changes Jul 15, 2025

View reviewed changes

merge main

6f3b708

zucchini-nlp enabled auto-merge (squash) July 18, 2025 08:27

zucchini-nlp disabled auto-merge July 18, 2025 08:38

Merge branch 'main' into chat-template-assistant-mask

43bc9a7

zucchini-nlp enabled auto-merge (squash) July 18, 2025 10:10

Merge branch 'main' into chat-template-assistant-mask

3847fc2

zucchini-nlp merged commit bcc0091 into huggingface:main Jul 18, 2025
25 checks passed

umbilnm mentioned this pull request Mar 9, 2026

Fix assistant_masks for multimodal inputs in apply_chat_template #44543

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[chat template] return assistant mask in processors#38545

[chat template] return assistant mask in processors#38545
zucchini-nlp merged 9 commits intohuggingface:mainfrom
zucchini-nlp:chat-template-assistant-mask

zucchini-nlp commented Jun 3, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jun 3, 2025

Uh oh!

zucchini-nlp commented Jun 12, 2025

Uh oh!

zucchini-nlp Jun 12, 2025

Uh oh!

Rocketknight1 left a comment

Uh oh!

github-actions Bot commented Jul 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		is_tokenizers_fast = hasattr(self, "tokenizer") and self.tokenizer.__class__.__name__.endswith("Fast")

Conversation

zucchini-nlp commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Jun 3, 2025

Uh oh!

zucchini-nlp commented Jun 12, 2025

Uh oh!

zucchini-nlp Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jul 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zucchini-nlp commented Jun 3, 2025 •

edited

Loading