Add ImageProcessorFast to Efficientnet processor by Yann-CV · Pull Request #37094 · huggingface/transformers

Yann-CV · 2025-03-28T21:23:02Z

What does this PR do?

Following #36978:
This pull request introduces a new fast image processor for EfficientNet models and integrates it into the existing codebase. The changes include updates to documentation, initialization files, and test cases to support the new EfficientNetImageProcessorFast.

Integration of `EfficientNetImageProcessorFast`:

docs/source/en/model_doc/efficientnet.md: Added documentation for EfficientNetImageProcessorFast.
src/transformers/__init__.py: Included EfficientNetImageProcessorFast in the import structure and import statements. [1] [2]
src/transformers/models/auto/image_processing_auto.py: Updated the image processor mapping to include EfficientNetImageProcessorFast. [1] [2]
src/transformers/models/efficientnet/__init__.py: Added import for EfficientNetImageProcessorFast.

Implementation of `EfficientNetImageProcessorFast`:

src/transformers/models/efficientnet/image_processing_efficientnet_fast.py: Added the implementation of the EfficientNetImageProcessorFast class, including methods for preprocessing, rescaling, and normalizing images.

Testing and Dummy Objects:

src/transformers/utils/dummy_torchvision_objects.py: Added a dummy class for EfficientNetImageProcessorFast to handle cases where torchvision is not available.
tests/models/efficientnet/test_image_processing_efficientnet.py: Updated test cases to include EfficientNetImageProcessorFast and ensure it is tested alongside the standard EfficientNetImageProcessor. [1] [2] [3]

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…cientnet

Yann-CV · 2025-03-29T08:02:14Z

I am spotting an inconsistency between slow (in efficientnet) and fast resize (base class):

    def test_resize(self):
        torch.manual_seed(0)
        data =  torch.randint(0, 256, (1, 3, 480, 640), dtype=torch.uint8)
        for image_processor_class in self.image_processor_list:
            image_processor = image_processor_class(**self.image_processor_dict)
            if image_processor_class == EfficientNetImageProcessorFast:
                image = data
                print(f"image: {image.shape} - {image.mean(dtype=torch.float32)}")
                resized_image = image_processor.resize(
                    image,
                    size=SizeDict(height=18, width=18),
                    interpolation=F.InterpolationMode.NEAREST,
                    antialias=False,
                )
                print(f"resized_image: {resized_image.shape} - {resized_image.mean(dtype=torch.float32)}")
            else:
                image = data.squeeze().permute(1, 2, 0).numpy()
                print(f"image: {image.shape} - {image.mean()}")
                resized_image = image_processor.resize(
                    image,
                    size={"height": 18, "width": 18},
                    resample=PILImageResampling.NEAREST
                )
                print(f"resized_image: {resized_image.shape} - {resized_image.mean()}")

is returning

image: (480, 640, 3) - 127.5092306857639
resized_image: (18, 18, 3) - 126.91769547325103
image: torch.Size([1, 3, 480, 640]) - 127.50922393798828
resized_image: torch.Size([1, 3, 18, 18]) - 121.97119140625

You can obtain quite a gap in mean value for the same interpolation mode. antialias does not impact the result

Yann-CV · 2025-03-29T10:14:15Z

All other method are outputing the same result (normalize and rescale) between slow and fast

Yann-CV · 2025-03-29T10:40:39Z

using bilinear resizing is making the test pass (too much difference between PIL and torchvision with nearest)

Yann-CV · 2025-03-29T12:28:18Z

@ydshieh @yonigozlan the test are failing on test non related to this pull request.
In addition of updating the resize interpolation in tests, I took the freedom to fix the seed of numpy in the tests.

Yann-CV added 6 commits March 28, 2025 17:14

[Fast Processor] EfficientNet

0c16e6c

improve to make it testable

3699153

apply all the make instructions

6da8714

Merge remote-tracking branch 'upstream/main' into fast_processor/effi…

9ff155c

…cientnet

fix test and add dtype to rescale

d672819

fix after make

08f690b

Yann-CV marked this pull request as ready for review March 28, 2025 22:39

Merge branch 'main' into fast_processor/efficientnet

f27cec3

github-actions Bot requested review from ydshieh and yonigozlan March 28, 2025 22:40

Yann-CV added 3 commits March 29, 2025 11:39

bilinear in test

42943ff

make it work for comparison

56d7237

add comment for bilinear

125b0d3

Yann-CV added 3 commits March 29, 2025 11:53

all tests pass

725fa6e

all make are ok

e827985

add numpy random seed in tests

9ac49d9

apply make again

26545b1

zshn25 mentioned this pull request Mar 30, 2025

Add EfficientNet Image PreProcessor #37055

Merged

5 tasks

yonigozlan mentioned this pull request Mar 31, 2025

[Contributions Welcome] Add Fast Image Processors #36978

Closed

81 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ImageProcessorFast to Efficientnet processor #37094

Add ImageProcessorFast to Efficientnet processor #37094
Yann-CV wants to merge 14 commits intohuggingface:mainfrom
Yann-CV:fast_processor/efficientnet

Yann-CV commented Mar 28, 2025 •

edited

Loading

Uh oh!

Yann-CV commented Mar 29, 2025

Uh oh!

Yann-CV commented Mar 29, 2025 •

edited

Loading

Uh oh!

Yann-CV commented Mar 29, 2025 •

edited

Loading

Uh oh!

Yann-CV commented Mar 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Yann-CV commented Mar 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Integration of EfficientNetImageProcessorFast:

Implementation of EfficientNetImageProcessorFast:

Testing and Dummy Objects:

Before submitting

Who can review?

Uh oh!

Yann-CV commented Mar 29, 2025

Uh oh!

Yann-CV commented Mar 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Yann-CV commented Mar 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Yann-CV commented Mar 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Yann-CV commented Mar 28, 2025 •

edited

Loading

Integration of `EfficientNetImageProcessorFast`:

Implementation of `EfficientNetImageProcessorFast`:

Yann-CV commented Mar 29, 2025 •

edited

Loading

Yann-CV commented Mar 29, 2025 •

edited

Loading