Fix to tuple conversion with config by qubvel · Pull Request #39257 · huggingface/transformers

qubvel · 2025-07-07T16:10:43Z

What does this PR do?

setting return_dict=False with config fails for models with sub-models wrapped with can_return_tuple or check_model_inputs

import torch
from transformers import LlamaConfig, LlamaForCausalLM

config = LlamaConfig(vocab_size=256, hidden_size=128, num_hidden_layers=2, num_attention_heads=4, intermediate_size=256)
model = LlamaForCausalLM(config)

# default: ModelOutput
input_ids = torch.tensor([[0, 1, 2, 3]])
with torch.no_grad():
    output = model(input_ids)

print(output)


# passing return_dict=False as a kwarg 
input_ids = torch.tensor([[0, 1, 2, 3]])
with torch.no_grad():
    output = model(input_ids, return_dict=False)

print(output)


# ERROR: setting return_dict=False in the config
model.config.return_dict = False
with torch.no_grad():
    output = model(input_ids)

print(output)

# Traceback (most recent call last):
#   File "/home/ubuntu/projects/transformers/test_llama_small.py", line 17, in <module>
#     output = model(input_ids)
#   File "/home/ubuntu/projects/transformers/.venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1751, in _wrapped_call_impl
#     return self._call_impl(*args, **kwargs)
#   File "/home/ubuntu/projects/transformers/.venv/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1762, in _call_impl
#     return forward_call(*args, **kwargs)
#   File "/home/ubuntu/projects/transformers/src/transformers/utils/generic.py", line 962, in wrapper
#     output = func(self, *args, **kwargs)
#   File "/home/ubuntu/projects/transformers/src/transformers/models/llama/modeling_llama.py", line 506, in forward
#     hidden_states = outputs.last_hidden_state
# AttributeError: 'tuple' object has no attribute 'last_hidden_state'

qubvel · 2025-07-07T16:14:21Z

        if return_dict_passed is not None:
            return_dict = return_dict_passed
-        output = func(self, *args, **kwargs)
+        output = func(self, *args, **kwargs, return_dict=True)


This way, it's going to work in case **kwargs: [TransformersKwargs] are properly propagated from the top module up to each wrapped module.

Just for the context, previously, we were recursively setting the module attribute _is_top_module to avoid passing **kwargs everywhere.

qubvel · 2025-07-07T16:14:40Z

-def set_attribute_for_modules(module: "torch.nn.Module", key: str, value: Any):
-    """
-    Set a value to a module and all submodules.
-    """
-    setattr(module, key, value)
-    for submodule in module.children():
-        set_attribute_for_modules(submodule, key, value)
-
-
-def del_attribute_from_modules(module: "torch.nn.Module", key: str):
-    """
-    Delete a value from a module and all submodules.
-    """
-    # because we might remove it previously in case it's a shared module, e.g. activation function
-    if hasattr(module, key):
-        delattr(module, key)
-
-    for submodule in module.children():
-        del_attribute_from_modules(submodule, key)
-
-


no longer needed

HuggingFaceDocBuilderDev · 2025-07-07T16:23:42Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Update can_return_tuple decorator

d760390

qubvel commented Jul 7, 2025

View reviewed changes

qubvel marked this pull request as ready for review July 7, 2025 16:15

This was referenced Apr 29, 2026

Cumulative feature and defect updates from recent Transformers PRs evalstate/transformers#42

Open

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#43

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix to tuple conversion with config#39257

Fix to tuple conversion with config#39257
qubvel wants to merge 1 commit intohuggingface:mainfrom
qubvel:fix-return-tuple

qubvel commented Jul 7, 2025

Uh oh!

qubvel Jul 7, 2025

Uh oh!

qubvel Jul 7, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jul 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

qubvel commented Jul 7, 2025

What does this PR do?

Uh oh!

qubvel Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

qubvel Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jul 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants