Fix `maybe_autocast` crashing on meta device tensors by Butanium · Pull Request #44984 · huggingface/transformers

Butanium · 2026-03-25T01:39:23Z

What does this PR do?

maybe_autocast calls torch.is_autocast_enabled(device_type) which raises a RuntimeError when device_type is "meta":

RuntimeError: unknown device type for autocast in get_autocast_dispatch_key_from_device_type

This breaks any code that runs a forward pass on meta tensors — for example, nnsight's .scan() context, which traces the computational graph using meta tensors without materializing weights.

The error path is: LlamaRotaryEmbedding.forward → maybe_autocast(device_type="meta", enabled=False) → torch.is_autocast_enabled("meta") → 💥

Since autocast is meaningless on meta tensors (they don't compute anything), this PR returns nullcontext() early when device_type == "meta".

This affects all 20+ model files that use maybe_autocast (via RoPE or directly), so fixing it at the source in maybe_autocast is preferable to patching each callsite.

Reproduction

import torch
torch.is_autocast_enabled("meta")  # RuntimeError

# With nnsight / any meta-tensor forward pass:
from nnsight import LanguageModel
model = LanguageModel("meta-llama/Llama-3.1-70B")
with model.scan("test"):  # crashes in RoPE forward
    pass

Fix

def maybe_autocast(device_type, dtype=None, enabled=True, cache_enabled=None):
    if device_type == "meta":
        return nullcontext()
    ...

Note on AI usage

🤖 Generated with Claude Code
👨 reviewed by the human:
This bug came out in some mech interp class and I maintain the nnterp library that was used and encounter this issue. claude helped me to traceback the issue to transformers and the fix proposed fix our issue and is clean and minimal

`torch.is_autocast_enabled("meta")` raises a RuntimeError because torch does not support autocast for the meta device. This breaks any code that runs a forward pass on meta tensors (e.g. nnsight's `.scan()` for tracing without materializing weights). Since autocast is meaningless on meta tensors, return `nullcontext()` early when `device_type == "meta"`. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Butanium · 2026-03-25T01:50:10Z

@scienceetonnante would be nice if this could be merged and included in next transformers version for nnterp! This came out in @gsarti mech interp class with this nnterp snippet

model = StandardizedTransformer(
    "meta-llama/Llama-3.1-70B",
    device_map="auto",
    remote=True)

I'll also push a fix for nnterp to fallback to remote execution renaming check if scan fails but that's a bit annoying

Rocketknight1 · 2026-03-25T12:40:40Z

This LGTM but I should check with the people who touched this code recently - cc @Cyrilvallez @hmellor for final review!

hmellor

Seems reasonable

Cyrilvallez

Weird usage pattern but alright!

Butanium · 2026-03-25T17:45:02Z

Thanks for the quick review and merge!

`torch.is_autocast_enabled("meta")` raises a RuntimeError because torch does not support autocast for the meta device. This breaks any code that runs a forward pass on meta tensors (e.g. nnsight's `.scan()` for tracing without materializing weights). Since autocast is meaningless on meta tensors, return `nullcontext()` early when `device_type == "meta"`. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

hmellor approved these changes Mar 25, 2026

View reviewed changes

Cyrilvallez approved these changes Mar 25, 2026

View reviewed changes

Cyrilvallez merged commit c17877c into huggingface:main Mar 25, 2026
18 of 20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `maybe_autocast` crashing on meta device tensors#44984

Fix `maybe_autocast` crashing on meta device tensors#44984
Cyrilvallez merged 1 commit intohuggingface:mainfrom
Butanium:fix/maybe-autocast-meta-device

Butanium commented Mar 25, 2026 •

edited

Loading

Uh oh!

Butanium commented Mar 25, 2026

Uh oh!

Rocketknight1 commented Mar 25, 2026

Uh oh!

hmellor left a comment

Uh oh!

Cyrilvallez left a comment

Uh oh!

Uh oh!

Butanium commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Butanium commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Reproduction

Fix

Note on AI usage

Uh oh!

Butanium commented Mar 25, 2026

Uh oh!

Rocketknight1 commented Mar 25, 2026

Uh oh!

hmellor left a comment

Choose a reason for hiding this comment

Uh oh!

Cyrilvallez left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Butanium commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Butanium commented Mar 25, 2026 •

edited

Loading