[XLM] Refactor output tracing to align with capture_outputs standardized architecture by aman-coder03 · Pull Request #44101 · huggingface/transformers

aman-coder03 · 2026-02-17T17:15:06Z

What does this PR do?

This PR refactors XLM's output tracing to align with the standardized output capturing patterns used across the codebase.

Key changes:

Refactors transformer blocks into a dedicated XLMLayer module to enable structured output capture
Integrates the capture_outputs decorator with _can_record_outputs to automatically capture attention weights from each transformer layer
Removes manual attention propagation logic and relies on the standardized output capturing infrastructure
Implements explicit hidden_states collection in XLMModel.forward to ensure embedding outputs and all intermediate layer outputs are correctly returned
Ensures proper handling and propagation of output_attentions, output_hidden_states, and return_dict flags

Why is this needed?

This aligns XLM with the standardized output tracing architecture introduced in #43979 ensuring consistent behavior across models and compatibility with the shared output capturing infrastructure.

github-actions · 2026-02-19T07:58:21Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: flaubert, xlm

… attentions propagation

aman-coder03 added 6 commits February 17, 2026 22:41

Refactor XLM output tracing using capture_outputs decorator

5162dda

Sync Flaubert copies with XLM after output tracing refactor

f439a1c

fix tensor

c849a72

tensor not defined correction

4a56546

Fix Flaubert copy inconsistencies after XLM output tracing refactor

1979931

Refactor Flaubert to use capture_outputs decorator for attention tracing

ea3a0a2

[XLM] Refactor output tracing using capture_outputs decorator and fix…

e5d3f39

… attentions propagation

This was referenced Apr 29, 2026

Cumulative feature and defect updates from recent Transformers PRs evalstate/transformers#42

Open

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#43

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[XLM] Refactor output tracing to align with capture_outputs standardized architecture#44101

[XLM] Refactor output tracing to align with capture_outputs standardized architecture#44101
aman-coder03 wants to merge 7 commits intohuggingface:mainfrom
aman-coder03:refactor-xlm-output-capture

aman-coder03 commented Feb 17, 2026

Uh oh!

github-actions Bot commented Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

aman-coder03 commented Feb 17, 2026

What does this PR do?

Key changes:

Why is this needed?

Uh oh!

github-actions Bot commented Feb 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant