Add Fast Image Processor for Flava by rootonchair · Pull Request #37135 · huggingface/transformers

rootonchair · 2025-03-31T11:46:25Z

What does this PR do?

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

github-actions · 2025-03-31T11:46:36Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. The CI will be paused while the PR is in draft mode. When it is ready for review, please click the Ready for review button (at the bottom of the PR page). This will assign reviewers and trigger CI.

yonigozlan

Hi again @rootonchair , very nice work! Quite an exotic image processor, but very nicely handled. Left a few comments on some things that could be simplified, and on writing a torch FlavaMaskingGenerator. Other than that, LGTM!

yonigozlan · 2025-04-02T02:29:58Z

+        resample = kwargs.pop("codebook_resample")
+        kwargs["codebook_interpolation"] = (
+            pil_torch_interpolation_mapping[resample] if isinstance(resample, (PILImageResampling, int)) else resample
+        )


It could be better to do that by overriding self._further_process_kwargs, to avoid overriding the whole preprocess function

Sure. Changed accordingly

yonigozlan · 2025-04-02T02:35:17Z

+        mask_group_min_aspect_ratio,
+        mask_group_max_aspect_ratio,
+    ) -> FlavaMaskingGenerator:
+        return FlavaMaskingGenerator(


The FlavaMaskingGenerator in the slow processing file generates numpy arrays masks, it would be great to write a FlavaMaskingGenerator generating torch tensors masks in this file

I have written another FlavaMaskingGenerator that operate on tensor and optimize redundant loops. However, I don't think we can optimize further to have them operate on batch input

yonigozlan

Thanks for iterating @rootonchair ! Looks ready to go after adding a short comment about the Bicubic/Lanczos issue. Let's wait for @ArthurZucker final approval then LGTM

yonigozlan · 2025-04-07T20:00:18Z

    codebook_do_resize = True
    codebook_size = {"height": 112, "width": 112}
-    codebook_resample = PILImageResampling.LANCZOS
+    codebook_resample = PILImageResampling.BICUBIC


As I said in other PRs, ideally we would keep Lanczos here, and add a warning that fast image processors don't support Lanczos before forcing Bicubic in preprocessing. Seeing that this is only for codebook pixels, and that return_codebook_pixels is False by default, a short comment explaining why we have Bicubic here instead of Lanczos might be enough.

Changed accordingly!

yonigozlan · 2025-04-07T20:00:56Z

+        encoding_slow = image_processor_slow(
+            dummy_image, return_tensors="pt", return_codebook_pixels=True, return_image_mask=True
+        )
+        encoding_fast = image_processor_fast(
+            dummy_image, return_tensors="pt", return_codebook_pixels=True, return_image_mask=True


Nice thanks for that

yonigozlan · 2025-04-14T11:42:06Z

Thanks for iterating! Updating the branch and running the full CI. If everything pass I'll merge :)

* support flava fast image processor * run style and quality * update test * update according to reviews * make style * update comment on BICUBIC * make style --------- Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com>

rootonchair added 2 commits March 31, 2025 18:40

support flava fast image processor

d39bd6c

run style and quality

2a8b918

github-actions Bot marked this pull request as draft March 31, 2025 11:46

rootonchair marked this pull request as ready for review March 31, 2025 11:46

github-actions Bot requested review from ydshieh and yonigozlan March 31, 2025 11:47

qubvel mentioned this pull request Mar 31, 2025

[Contributions Welcome] Add Fast Image Processors #36978

Closed

81 tasks

update test

bfa7095

yonigozlan reviewed Apr 2, 2025

View reviewed changes

rootonchair added 2 commits April 2, 2025 14:58

update according to reviews

bb0890c

make style

39b85bd

yonigozlan approved these changes Apr 7, 2025

View reviewed changes

yonigozlan and others added 4 commits April 7, 2025 16:03

Merge branch 'main' into flava_fast_image_processor

5ef7d1f

update comment on BICUBIC

9e625ae

make style

948152e

Merge branch 'main' into flava_fast_image_processor

4cdea27

Merge branch 'main' into flava_fast_image_processor

4722dbe

yonigozlan merged commit 49b9a69 into huggingface:main Apr 14, 2025
19 of 20 checks passed

rootonchair deleted the flava_fast_image_processor branch April 15, 2025 18:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Fast Image Processor for Flava#37135

Add Fast Image Processor for Flava#37135
yonigozlan merged 10 commits intohuggingface:mainfrom
rootonchair:flava_fast_image_processor

rootonchair commented Mar 31, 2025

Uh oh!

github-actions Bot commented Mar 31, 2025

Uh oh!

yonigozlan left a comment

Uh oh!

yonigozlan Apr 2, 2025

Uh oh!

rootonchair Apr 2, 2025

Uh oh!

yonigozlan Apr 2, 2025

Uh oh!

rootonchair Apr 2, 2025

Uh oh!

yonigozlan left a comment

Uh oh!

yonigozlan Apr 7, 2025

Uh oh!

rootonchair Apr 10, 2025

Uh oh!

yonigozlan Apr 7, 2025

Uh oh!

yonigozlan commented Apr 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rootonchair commented Mar 31, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

github-actions Bot commented Mar 31, 2025

Uh oh!

yonigozlan left a comment

Choose a reason for hiding this comment

Uh oh!

yonigozlan Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

rootonchair Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

yonigozlan Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

rootonchair Apr 2, 2025

Choose a reason for hiding this comment

Uh oh!

yonigozlan left a comment

Choose a reason for hiding this comment

Uh oh!

yonigozlan Apr 7, 2025

Choose a reason for hiding this comment

Uh oh!

rootonchair Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

yonigozlan Apr 7, 2025

Choose a reason for hiding this comment

Uh oh!

yonigozlan commented Apr 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants