mllama outputs refactor by itazap · Pull Request #39643 · huggingface/transformers

itazap · 2025-07-24T15:43:28Z

refactor using latest outputs merge

HuggingFaceDocBuilderDev · 2025-07-24T15:56:31Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker

Nice cleanup!

ArthurZucker · 2025-07-28T11:31:27Z

+        "hidden_states": [
+            OutputRecorder(MllamaTextSelfAttention, index=0),
+            OutputRecorder(MllamaTextCrossAttention, index=0),
+        ],
+        "attentions": [
+            OutputRecorder(MllamaTextSelfAttention, index=1, layer_name="self_attn"),
+            OutputRecorder(MllamaTextSelfAttention, index=1, layer_name="cross_attn"),
+            OutputRecorder(MllamaTextCrossAttention, index=1, layer_name="cross_attn"),
+        ],


Index can be optinal, by default hidden is 0 and attention is 1

Updated hidden_states to "hidden_states": [MllamaSelfAttentionDecoderLayer, MllamaCrossAttentionDecoderLayer],, but for attentions still using direct layers

ArthurZucker

Decoder layer should only need to return HS, never the attention weights no?

github-actions · 2025-07-28T13:30:00Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: mllama

* mllama outputs refactor * forgot kwargs * fix output * add can_record_outputs * correct @check_model_inputs placement * ruff and copies * rebase * feedback * only return hidden_states --------- Co-authored-by: ita.zaporozhets@huggingface.co <ita_zaporozhets@ip-26-0-161-153.ec2.internal> Co-authored-by: ita.zaporozhets@huggingface.co <ita_zaporozhets@ip-26-0-162-14.ec2.internal>

itazap force-pushed the mllama_new_outputs branch from e0beb91 to d409f10 Compare July 28, 2025 08:07

itazap requested review from ArthurZucker and Cyrilvallez July 28, 2025 08:07

ArthurZucker reviewed Jul 28, 2025

View reviewed changes

itazap force-pushed the mllama_new_outputs branch from af0e94c to 3bd4c0a Compare July 28, 2025 12:20

itazap requested a review from ArthurZucker July 28, 2025 12:41

itazap force-pushed the mllama_new_outputs branch from f7811d4 to 4ebcc20 Compare July 28, 2025 12:42

ArthurZucker approved these changes Jul 28, 2025

View reviewed changes

Comment thread src/transformers/models/mllama/modeling_mllama.py Outdated

Comment thread src/transformers/models/mllama/modeling_mllama.py Outdated

Comment thread src/transformers/models/mllama/modeling_mllama.py Outdated

Comment thread src/transformers/models/mllama/modeling_mllama.py Outdated

ita.zaporozhets@huggingface.co and others added 9 commits July 28, 2025 15:38

mllama outputs refactor

c4d27ae

forgot kwargs

ed1c799

fix output

767bce1

add can_record_outputs

1fc04b1

correct @check_model_inputs placement

be90868

ruff and copies

70c3986

rebase

f13fa4c

feedback

eda7d19

only return hidden_states

fa870ff

itazap force-pushed the mllama_new_outputs branch from b71d470 to fa870ff Compare July 28, 2025 13:38

itazap merged commit da823fc into main Jul 28, 2025
20 checks passed

itazap deleted the mllama_new_outputs branch July 28, 2025 13:59

itazap restored the mllama_new_outputs branch July 30, 2025 10:10

Isotr0py mentioned this pull request Aug 11, 2025

Fix regression in mllama vision encoder #40083

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mllama outputs refactor#39643

mllama outputs refactor#39643
itazap merged 9 commits intomainfrom
mllama_new_outputs

itazap commented Jul 24, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Jul 24, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ArthurZucker Jul 28, 2025

Uh oh!

itazap Jul 28, 2025

Uh oh!

Uh oh!

ArthurZucker left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jul 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

itazap commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jul 24, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ArthurZucker Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

itazap Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jul 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

itazap commented Jul 24, 2025 •

edited

Loading