[EfficientLoFTR] LRU Cached embedding computation at inference by sbucaille · Pull Request #40329 · huggingface/transformers

sbucaille · 2025-08-20T23:51:41Z

What does this PR do?

Reverted EfficientLoFTR RoPE computation back to inference time to allow different image sizes usage
Decorated with LRU Cache similarly to DinoV3 implementation

Who can review?

@qubvel

… with lru cache

qubvel

Thanks, just a question, otherwise looks good 👍

qubvel · 2025-08-21T16:18:38Z

+    i_indices = torch.ones(embed_height, embed_width).cumsum(0).float().unsqueeze(-1)
+    j_indices = torch.ones(embed_height, embed_width).cumsum(1).float().unsqueeze(-1)


device / dtype is also needed, no?

compute_embeddings is only used inside the torch.autocast context manager so I assume we don't need it ?

qubvel · 2025-08-21T16:18:51Z

+    i_indices = torch.ones(embed_height, embed_width).cumsum(0).float().unsqueeze(-1)
+    j_indices = torch.ones(embed_height, embed_width).cumsum(1).float().unsqueeze(-1)
+
+    emb = torch.zeros(1, embed_height, embed_width, hidden_size // 2)


same here for zeros

sbucaille · 2025-08-26T14:55:57Z

@qubvel gentle bump! I answered your comment

qubvel · 2025-08-26T15:15:23Z

Thanks for the ping! Missed it.

I still think we should pass inv_freq.device and inv_freq.dtype while creating tensors to make this function safe to use even outside of the autocast block

sbucaille · 2025-08-26T15:42:56Z

Adressed in 3fa6b94

qubvel · 2025-08-26T15:49:48Z

run-slow: efficientloftr

github-actions · 2025-08-26T15:51:18Z

This comment contains run-slow, running the specified jobs:

models: ['models/efficientloftr']
quantizations: [] ...

HuggingFaceDocBuilderDev · 2025-08-26T15:59:28Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sbucaille · 2025-08-26T16:40:03Z

You can rerun slow tests, it should be fixed, I forgot the features were aggregated before the attention, so the embeddings need to be of correct size

qubvel · 2025-08-27T11:46:40Z

run-slow: efficientloftr

github-actions · 2025-08-27T11:48:15Z

This comment contains run-slow, running the specified jobs:

models: ['models/efficientloftr']
quantizations: [] ...

github-actions · 2025-08-27T13:44:25Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: efficientloftr

qubvel · 2025-08-27T13:43:59Z

        if self.hidden_size != self.out_features[-1]:
            raise ValueError(
-                f"hidden_size should be equal to the last value in out_features. hidden_size = {self.hidden_size}, out_features = {self.stage_out_channels}"
+                f"hidden_size should be equal to the last value in out_features. hidden_size = {self.hidden_size}, out_features = {self.out_features[-1]}"


fix attribute name

qubvel · 2025-08-27T13:44:21Z

+        out_features: list[int] = [32, 32, 128],
        stage_stride: list[int] = [2, 1, 2],
        q_aggregation_kernel_size: int = 1,
        kv_aggregation_kernel_size: int = 1,
        q_aggregation_stride: int = 1,
        kv_aggregation_stride: int = 1,
        num_attention_layers: int = 2,
        num_attention_heads: int = 8,
-        hidden_size: int = 64,
+        hidden_size: int = 128,


that's for FA2 tests to pass

How didn't I catch this one before ? 🤔 But thanks for taking care of it !

That's skipped locally in case you don't have FA2 installed 😄

qubvel · 2025-08-27T13:44:51Z

run-slow: efficientloftr

github-actions · 2025-08-27T13:46:20Z

This comment contains run-slow, running the specified jobs:

models: ['models/efficientloftr']
quantizations: [] ...

fix: reverted efficientloftr embeddings computation to inference time…

20cf94a

… with lru cache

qubvel reviewed Aug 21, 2025

View reviewed changes

fix: added dtype and device for torch ones and zeros creation

3fa6b94

fix: fixed embed height and width computation with aggregation

9526316

fix: make style

bf9ca10

qubvel self-requested a review August 27, 2025 13:41

qubvel added 2 commits August 27, 2025 13:42

fix error message

c211fa5

fix fa2 tests

aba80b7

qubvel approved these changes Aug 27, 2025

View reviewed changes

qubvel merged commit 52aaa3f into huggingface:main Aug 27, 2025
20 checks passed

sbucaille deleted the eloftr-dynamic-rope branch September 2, 2025 03:14

		i_indices = torch.ones(embed_height, embed_width).cumsum(0).float().unsqueeze(-1)
		j_indices = torch.ones(embed_height, embed_width).cumsum(1).float().unsqueeze(-1)

Conversation

sbucaille commented Aug 20, 2025

What does this PR do?

Who can review?

Uh oh!

qubvel left a comment

Choose a reason for hiding this comment

Uh oh!

qubvel Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

sbucaille Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

qubvel Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

sbucaille commented Aug 26, 2025

Uh oh!

qubvel commented Aug 26, 2025

Uh oh!

sbucaille commented Aug 26, 2025

Uh oh!

qubvel commented Aug 26, 2025

Uh oh!

github-actions Bot commented Aug 26, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Aug 26, 2025

Uh oh!

sbucaille commented Aug 26, 2025

Uh oh!

qubvel commented Aug 27, 2025

Uh oh!

github-actions Bot commented Aug 27, 2025

Uh oh!

github-actions Bot commented Aug 27, 2025

Uh oh!

qubvel Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

qubvel Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

sbucaille Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

qubvel Aug 27, 2025

Choose a reason for hiding this comment

Uh oh!

qubvel commented Aug 27, 2025

Uh oh!

github-actions Bot commented Aug 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants