fix(serving): resolve rust tokenizer from ProcessorMixin in streaming generation by sharziki · Pull Request #45368 · huggingface/transformers

sharziki · 2026-04-11T02:34:32Z

Summary

Fixes #45362 — transformers chat crashes with AttributeError: 'Qwen3VLProcessor' object has no attribute '_tokenizer' when streaming responses from Qwen models.

Root cause: GenerateManager.generate_streaming() and CBGenerateManager.generate_streaming() access processor._tokenizer to get the Rust tokenizer backend. This works for PreTrainedTokenizerFast (which stores the Rust backend at ._tokenizer), but ProcessorMixin subclasses like Qwen3VLProcessor expose the fast tokenizer at the public .tokenizer attribute instead.

Fix: Use getattr(processor, "tokenizer", processor)._tokenizer to first resolve the fast tokenizer (which is processor.tokenizer for ProcessorMixin, or processor itself for PreTrainedTokenizerFast), then access ._tokenizer for the Rust backend.

Two locations updated:

GenerateManager.generate_streaming() (line 565)
CBGenerateManager.generate_streaming() (line 664)

Coordination

Issue discussion: Qwen3.5-35B crashes with transformers chat #45362 (comment)
No existing open PRs for this issue.
AI assistance (Claude Code) was used. All changes reviewed and validated by the submitting human.

Test plan

Verify transformers chat Qwen/Qwen3.5-35B-A3B no longer crashes on first prompt
Verify streaming works correctly with non-processor models (e.g. text-only models)
ruff check src/transformers/cli/serving/utils.py passes

🤖 Generated with Claude Code

… generation ProcessorMixin subclasses (e.g. Qwen3VLProcessor) expose the fast tokenizer at .tokenizer, not ._tokenizer. Use getattr() to handle both ProcessorMixin and PreTrainedTokenizerFast when extracting the rust tokenizer backend for DirectStreamer and CBStreamer. Fixes huggingface#45362 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Rocketknight1 · 2026-04-13T12:45:22Z

cc @zucchini-nlp for processors and @LysandreJik for transformers serve maybe?

zucchini-nlp

Yep, processor can't have a private _tokenizer attr

HuggingFaceDocBuilderDev · 2026-04-13T13:12:47Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp · 2026-04-13T14:52:47Z

I'll merge since it seems quite straightforward, and users reporting they cannot run Gemma4

… generation (huggingface#45368) ProcessorMixin subclasses (e.g. Qwen3VLProcessor) expose the fast tokenizer at .tokenizer, not ._tokenizer. Use getattr() to handle both ProcessorMixin and PreTrainedTokenizerFast when extracting the rust tokenizer backend for DirectStreamer and CBStreamer. Fixes huggingface#45362 Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

zucchini-nlp approved these changes Apr 13, 2026

View reviewed changes

zucchini-nlp mentioned this pull request Apr 13, 2026

transformers serve crashes with AttributeError: 'Gemma4Processor' object has no attribute '_tokenizer' #45406

Closed

4 tasks

zucchini-nlp added this pull request to the merge queue Apr 13, 2026

Merged via the queue into huggingface:main with commit a1b89d7 Apr 13, 2026
18 checks passed

Brianzhengca mentioned this pull request Apr 15, 2026

chat/completions API fail on Qwen3.5-0.8B for streaming inference #45464

Open

4 tasks

evalstate mentioned this pull request Apr 28, 2026

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#41

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(serving): resolve rust tokenizer from ProcessorMixin in streaming generation#45368

fix(serving): resolve rust tokenizer from ProcessorMixin in streaming generation#45368
zucchini-nlp merged 1 commit intohuggingface:mainfrom
sharziki:fix/45362-processor-tokenizer-attribute

sharziki commented Apr 11, 2026

Uh oh!

Rocketknight1 commented Apr 13, 2026

Uh oh!

zucchini-nlp left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 13, 2026

Uh oh!

zucchini-nlp commented Apr 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

sharziki commented Apr 11, 2026

Summary

Coordination

Test plan

Uh oh!

Rocketknight1 commented Apr 13, 2026

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Apr 13, 2026

Uh oh!

zucchini-nlp commented Apr 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants