Adding superglue fast image processing by AlphaOrOmega · Pull Request #41394 · huggingface/transformers

AlphaOrOmega · 2025-10-06T22:29:36Z

What does this PR do?

TLDR :

Implement fast processor for SuperGlue
About 3 times faster

This PR aims to translate the features of the class SuperGlueImageProcessor in the fast equivalent class SuperGlueImageProcessorFast.
The implementation heavily follows the standard implementation but reduces memory consumption and about 3 times the execution speed on my hardware.
The implementation mostly refactor the image formatting in the preprocessing step, notably by using torch tensors instead of PIL or Numpy.

Test Performed

RUN_SLOW=1 python -m pytest tests/models/superglue/test_image_processing_superglue.py

With an additional test based on the default processor tester (this test has not to be included in the repo) :

@require_vision
@require_torch
def test_fast_is_faster_than_slow(self):
    if not self.test_slow_image_processor or not self.test_fast_image_processor:
        self.skipTest(reason="Skipping speed test")

    if self.image_processing_class is None or self.fast_image_processing_class is None:
        self.skipTest(reason="Skipping speed test as one of the image processors is not defined")

    def measure_time(image_processor, image):
        # Warmup
        for _ in range(5):
            _ = image_processor(image, return_tensors="pt")
        all_times = []
        for _ in range(10):
            start = time.time()
            _ = image_processor(image, return_tensors="pt")
            all_times.append(time.time() - start)
        # Take the average of the fastest 3 runs
        avg_time = sum(sorted(all_times[:3])) / 3.0
        return avg_time

    dummy_images = self.image_processor_tester.prepare_image_inputs(equal_resolution=False, torchify=True)
    image_processor_slow = self.image_processing_class(**self.image_processor_dict)
    image_processor_fast = self.fast_image_processing_class(**self.image_processor_dict)

    fast_time = measure_time(image_processor_fast, dummy_images)
    slow_time = measure_time(image_processor_slow, dummy_images)

    self.assertLessEqual(fast_time, slow_time)

By reviewing the flame graph, I noticed the improvement in every __calls__ made to the fast version.

Callers of the old processor, and the full execution time of the method:

The equivalent but with the fast processor:

Some calls made during the test passes directly to the preprocess function, without passing by the __call__ one, I am including them as well:
Slow

Fast

Before submitting

[ ] This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
[ X ] Did you read the contributor guideline,
Pull Request section?
[ X ] Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
link : [Contributions Welcome] Add Fast Image Processors [Contributions Welcome] Add Fast Image Processors #36978 (comment)
[ X ] Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
[ ] Did you write any new necessary tests?

Who can review?

Thank you for reviewing my PR @yonigozlan (or anyone else :) )

…function refactor

… torch first method

Rocketknight1 · 2025-10-07T12:42:15Z

cc @yonigozlan

yonigozlan

Hi @AlphaOrOmega, thanks for working on this!
Please have a thorough look at the guide in this PR on how to implement a fast image processor correctly: #36978

You can also have a look at image_processing_efficientloftr_fast.py, as it should be almost identical to this image processor!

…implementation

…implementation' into fast_image_processing_superglue_implementation

AlphaOrOmega · 2025-10-10T22:36:31Z

Hi @yonigozlan,

Thank you for the feedback, the referenced processor was indeed very close to the one I implemented, so I re-used relevant code and ensured the logic was still here,

Could you please review the recent changes ?

Thank you

…ng_superglue_implementation

yonigozlan

Thanks a lot @AlphaOrOmega for working on this! Just added a commit to use modular for efficientloftr using this new image processor. Let's wait for the CI to pass then we'll merge!

…s://github.com/AlphaOrOmega/transformers into fast_image_processing_superglue_implementation

HuggingFaceDocBuilderDev · 2025-10-16T18:20:15Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

…s://github.com/AlphaOrOmega/transformers into fast_image_processing_superglue_implementation

github-actions · 2025-10-16T19:14:30Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto, efficientloftr, lightglue, superglue

* Default implementation - no time improvement * Improved implementation - apparently 2 times faster with only simple function refactor * elementary torch first approach, still need further implementation of torch first method * torch-first approach finished * refactor processor * refactor test * partial doc update * EfficientLoFTRImageProcessorFast based implementation * EfficientLoFTRImageProcessorFast based implementation * Logic checked - Test Passed - Validated execution speed * use modular for efficientloftr * fix import --------- Co-authored-by: yonigozlan <yoni.gozlan@huggingface.co> Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com>

AlphaOrOmega and others added 7 commits October 4, 2025 17:53

Default implementation - no time improvement

3d6bbe1

Improved implementation - apparently 2 times faster with only simple …

875a617

…function refactor

elementary torch first approach, still need further implementation of…

04ea589

… torch first method

torch-first approach finished

e887f45

refactor processor

25198d4

refactor test

1076191

Merge branch 'main' into fast_image_processing_superglue_implementation

d76b423

Merge branch 'main' into fast_image_processing_superglue_implementation

1576b17

yonigozlan reviewed Oct 9, 2025

View reviewed changes

AlphaOrOmega and others added 7 commits October 10, 2025 13:23

Merge branch 'main' into fast_image_processing_superglue_implementation

f3299a2

partial doc update

83d976c

EfficientLoFTRImageProcessorFast based implementation

0628fd8

Merge branch 'huggingface:main' into fast_image_processing_superglue_…

430197a

…implementation

Merge remote-tracking branch 'origin/fast_image_processing_superglue_…

4514ba2

…implementation' into fast_image_processing_superglue_implementation

EfficientLoFTRImageProcessorFast based implementation

7651579

Logic checked - Test Passed - Validated execution speed

d4e6371

yonigozlan added 2 commits October 16, 2025 17:40

Merge remote-tracking branch 'upstream/main' into fast_image_processi…

5f352f0

…ng_superglue_implementation

use modular for efficientloftr

649590e

yonigozlan approved these changes Oct 16, 2025

View reviewed changes

yonigozlan and others added 3 commits October 16, 2025 20:03

Merge branch 'main' into fast_image_processing_superglue_implementation

94ae524

fix import

ba663a7

Merge branch 'fast_image_processing_superglue_implementation' of http…

f283936

…s://github.com/AlphaOrOmega/transformers into fast_image_processing_superglue_implementation

yonigozlan enabled auto-merge (squash) October 16, 2025 18:08

AlphaOrOmega and others added 4 commits October 16, 2025 20:48

Merge branch 'main' into fast_image_processing_superglue_implementation

d22b309

nit

af60ead

Merge branch 'fast_image_processing_superglue_implementation' of http…

69417aa

…s://github.com/AlphaOrOmega/transformers into fast_image_processing_superglue_implementation

remove comment

4774877

yonigozlan merged commit 354567d into huggingface:main Oct 16, 2025
22 checks passed

yonigozlan mentioned this pull request Nov 4, 2025

[Contributions Welcome] Add Fast Image Processors #36978

Closed

81 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding superglue fast image processing#41394

Adding superglue fast image processing#41394
yonigozlan merged 24 commits intohuggingface:mainfrom
AlphaOrOmega:fast_image_processing_superglue_implementation

AlphaOrOmega commented Oct 6, 2025 •

edited

Loading

Uh oh!

Rocketknight1 commented Oct 7, 2025

Uh oh!

yonigozlan left a comment

Uh oh!

AlphaOrOmega commented Oct 10, 2025

Uh oh!

yonigozlan left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Oct 16, 2025

Uh oh!

github-actions Bot commented Oct 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

AlphaOrOmega commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Test Performed

Before submitting

Who can review?

Uh oh!

Rocketknight1 commented Oct 7, 2025

Uh oh!

yonigozlan left a comment

Choose a reason for hiding this comment

Uh oh!

AlphaOrOmega commented Oct 10, 2025

Uh oh!

yonigozlan left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Oct 16, 2025

Uh oh!

github-actions Bot commented Oct 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

AlphaOrOmega commented Oct 6, 2025 •

edited

Loading