[processors] Unbloating simple processors by zucchini-nlp · Pull Request #40377 · huggingface/transformers

zucchini-nlp · 2025-08-22T15:05:50Z

What does this PR do?

I think most processor don't have special functions except for passing each modality to its own preprocessor and combining outputs. This PR is an attempt to modularize processor's __call__ method, we define a default call and delete model-specific code in processor files if it is same as the default one.

Currently we have a few patterns in processors, so imo we can have model-specific methods to handle preparing inputs etc., and keep common code in the Mixin

Simply combine outputs from each attribute
Expand text sequences with special tokens and then combine output
Special handling for bboxes and other image-like inputs

I will split it into several PRs to make review process easier and faster. This PR will only start from easy processors

molbap · 2025-08-22T15:14:09Z

Very nice initiative, it's something that has been bothering me for a while. It could also be the occasion to allow users to have their custom processing piped in, same way we externalize attention classes. I know it's something that's requested sometimes by users especially concerned with processing in their training loop

HuggingFaceDocBuilderDev · 2025-08-22T15:15:13Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp · 2025-08-22T15:22:03Z

It could also be the occasion to allow users to have their custom processing piped in, same way we externalize attention classes.

Yeah, 100%, would love to sort out processors, Let's break BC taking advantage of v5! 😆

zucchini-nlp · 2025-08-25T10:34:25Z

Failing tests are unrelated and are failing on main as well

zucchini-nlp · 2025-08-29T11:25:38Z

@bot /style

molbap

👀 unbloat unbloat 👀

molbap · 2025-08-29T13:27:24Z

        processor = self.processor_class.from_pretrained(
            "deepseek-community/Janus-Pro-1B",
            extra_special_tokens=special_image_tokens,
+            **self.prepare_processor_dict(),


maybe declare it above to avoid the nesting

zucchini-nlp · 2025-09-03T16:20:59Z

@qubvel gentle ping ;)

qubvel

Thanks! Very nice unbloating 🔥 🔥 🔥

The only thing that caught my eye is a changed signature, it would be perfect to keep it

main

PR

zucchini-nlp · 2025-09-08T11:25:54Z

Hmm ig that was caused by a different PR and it is as the second option in main branch. But I get the idea that processor specific kwargs (if any) will not be in typing

qubvel · 2025-09-08T12:10:38Z

Hmm ig that was caused by a #40676 and it is as the second option in main branch. But I get the idea that processor specific kwargs (if any) will not be in typing

ahh, thanks for letting me know, it seems I didn't pull the latest changes 😄

github-actions · 2025-09-09T16:20:16Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: align, altclip, bridgetower, bros, chameleon, chinese_clip, clap, clip, clipseg, clvp, colpali, deepseek_vl, deepseek_vl_hybrid, donut, emu3, flava

zucchini-nlp · 2025-09-10T08:37:13Z

Since the typing hints are not changed in comparison to main, merging. I explored a way if we want to have different typing in kwargs for specific models, but seems that Generic doesn't work well with Unpack yet :(

I'll see if there are any better options

* modularize processor - step 1 * typos * why raise error, super call check it also * tiny update * fix copies * fix style and test * lost an import / fix copies * fix tests * oops deleted accidentally

zucchini-nlp added 4 commits August 22, 2025 15:54

modularize processor - step 1

3d429c9

typos

c4c47af

why raise error, super call check it also

1685395

tiny update

37244ce

zucchini-nlp added 2 commits August 22, 2025 17:40

fix copies

6a8d4d2

Merge branch 'main' into clean-processors

feba24c

zucchini-nlp requested a review from qubvel August 25, 2025 10:34

Merge branch 'main' into clean-processors

8076cc6

molbap reviewed Aug 29, 2025

View reviewed changes

zucchini-nlp added 4 commits September 1, 2025 13:32

Merge branch 'main' into clean-processors

6aff7bb

fix style and test

d31632e

Merge branch 'main' into clean-processors

c3fa29e

Merge branch 'main' into clean-processors

32e5b4e

zucchini-nlp added 3 commits September 5, 2025 13:08

Merge branch 'main' into clean-processors

affba48

Merge branch 'main' into clean-processors

fc8fab3

lost an import / fix copies

0e91055

qubvel reviewed Sep 8, 2025

View reviewed changes

Comment thread src/transformers/models/clap/processing_clap.py Outdated

qubvel approved these changes Sep 8, 2025

View reviewed changes

zucchini-nlp added 2 commits September 9, 2025 16:42

fix tests

6c076d5

oops deleted accidentally

fc05d4f

Merge branch 'main' into clean-processors

b9b8101

zucchini-nlp merged commit 08edec9 into huggingface:main Sep 10, 2025
23 checks passed

harshaljanjani mentioned this pull request Mar 14, 2026

fix(testing): Fix Kyutai Speech-To-Text and LongCatFlash test failures on main CI #44695

Merged

5 tasks

Conversation

zucchini-nlp commented Aug 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

molbap commented Aug 22, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Aug 22, 2025

Uh oh!

zucchini-nlp commented Aug 22, 2025

Uh oh!

zucchini-nlp commented Aug 25, 2025

Uh oh!

zucchini-nlp commented Aug 29, 2025

Uh oh!

molbap left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

molbap Aug 29, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp commented Sep 3, 2025

Uh oh!

qubvel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

zucchini-nlp commented Sep 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qubvel commented Sep 8, 2025

Uh oh!

github-actions Bot commented Sep 9, 2025

Uh oh!

zucchini-nlp commented Sep 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zucchini-nlp commented Aug 22, 2025 •

edited

Loading

zucchini-nlp commented Sep 8, 2025 •

edited

Loading

zucchini-nlp commented Sep 10, 2025 •

edited

Loading