:rotating_light: [`Kernels`] Fix kernel function registration by vasqu · Pull Request #45420 · huggingface/transformers

vasqu · 2026-04-13T23:36:14Z

Breaking change

🚨 Slightly breaking change: We no longer register the hidden rotary_fn. Users shouldn't have relied on those but in any case marking it, e.g. self.rotary_fn(...) within the Attention module does not work anymore as the reference is deleted from now on

Description

As per title, we do not want to have proper nn.Modules to be registered for kernels exchanged functions - they are not proper modules (and they are never called as such)! They act as exchange format for kernels but functionally they should stay as pure functions only.

The exact reasons are numerous, but one recent example is deepspeed zero 3 which cannot handle this as the module is never properly called in the forward on the module directly (untracable) and it changes module structures after model construction (fixable by changing order of inits tbh).

This PR changes the core functionality to make the module registration temporarily under the parent module, discover the exchangable functions, and delete them from the visible interface. For BC purposes, we still keep a self reference that already exists (now as simple attribute, not module).

HuggingFaceDocBuilderDev · 2026-04-13T23:46:00Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Cyrilvallez

Nice! Mostly nits to avoid relying on internals too much!

vasqu · 2026-04-17T19:11:39Z

+                Mode.TRAINING: FuncRepository(
+                    repo_id="kernels-community/rotary", func_name="apply_rotary_transformers"
+                ),


Not sure why this wasnt included for training before but it runs with deepspeed just fine + https://github.com/Dao-AILab/flash-attention/blob/b65ae6b175f2438de55601695b6a21971fc5e429/flash_attn/layers/rotary.py#L38-L90

vasqu · 2026-04-17T19:12:08Z

+        def attach_hidden_kernels(module):
+            for name, fn in getattr(module, "_hidden_kernels", {}).items():
+                if name not in dict(module.named_children()):
+                    module.register_module(name, fn)
+
+        def detach_hidden_kernels(module):
+            for name in getattr(module, "_hidden_kernels", {}):
+                delattr(module, name)


Removed the internals structure and rely on native APIs instead as suggested

Cyrilvallez

LGTM, thanks!

Cyrilvallez · 2026-04-20T09:45:25Z

+        self.apply(attach_hidden_kernels)
+        try:


Would put the apply inside the try as well, but not a big deal at all!

* fix attmpt * proper fix - also works with deepspeed * rely less on internals and add rotary to training * move under the try as well

vasqu added 2 commits April 13, 2026 22:42

fix attmpt

dcd4295

proper fix - also works with deepspeed

348944b

Cyrilvallez reviewed Apr 17, 2026

View reviewed changes

Comment thread src/transformers/integrations/hub_kernels.py Outdated

Comment thread src/transformers/modeling_utils.py Outdated

Comment thread src/transformers/modeling_utils.py Outdated

Comment thread src/transformers/modeling_utils.py Outdated

vasqu and others added 2 commits April 17, 2026 21:08

rely less on internals and add rotary to training

638cd8b

Merge branch 'main' into fix-kernels-loading-order

b997755

vasqu marked this pull request as ready for review April 17, 2026 19:08

vasqu commented Apr 17, 2026

View reviewed changes

vasqu changed the title ~~[Kernels] Fix kernel function registration~~ 🚨 [Kernels] Fix kernel function registration Apr 17, 2026

vasqu requested a review from Cyrilvallez April 17, 2026 19:15

Cyrilvallez approved these changes Apr 20, 2026

View reviewed changes

move under the try as well

4e8e748

vasqu enabled auto-merge April 20, 2026 12:58

vasqu added this pull request to the merge queue Apr 20, 2026

Merged via the queue into huggingface:main with commit 253809c Apr 20, 2026
28 checks passed

vasqu deleted the fix-kernels-loading-order branch April 20, 2026 13:29

evalstate mentioned this pull request Apr 28, 2026

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#41

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🚨 [`Kernels`] Fix kernel function registration#45420

🚨 [`Kernels`] Fix kernel function registration#45420
vasqu merged 5 commits intohuggingface:mainfrom
vasqu:fix-kernels-loading-order

vasqu commented Apr 13, 2026 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 13, 2026

Uh oh!

Cyrilvallez left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vasqu Apr 17, 2026

Uh oh!

vasqu Apr 17, 2026

Uh oh!

Cyrilvallez left a comment

Uh oh!

Cyrilvallez Apr 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

vasqu commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Breaking change

Description

Uh oh!

HuggingFaceDocBuilderDev commented Apr 13, 2026

Uh oh!

Cyrilvallez left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vasqu Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

vasqu Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

Cyrilvallez left a comment

Choose a reason for hiding this comment

Uh oh!

Cyrilvallez Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vasqu commented Apr 13, 2026 •

edited

Loading