[ViLT] Refactor output handling to align with standardized patterns by aman-coder03 · Pull Request #44098 · huggingface/transformers

aman-coder03 · 2026-02-17T16:32:34Z

What does this PR do?

This PR refactors ViLT's output handling to align with the standardized patterns used across the codebase.

Key changes:

Removes manual hidden_states/attentions propagation and passes output_attentions, output_hidden_states, and return_dict cleanly through the encoder call in ViltModel.forward
Adds **kwargs forwarding to all child model self.vilt(...) calls (ViltForMaskedLM, ViltForQuestionAnswering, ViltForImageAndTextRetrieval, ViltForTokenClassification) so output flags are correctly propagated from the top-level forward call down to the base model
Fixes ViltForTokenClassification to handle the inputs_embeds path correctly when computing text_input_size
Fixes ViltForImagesAndTextClassification to return None instead of empty lists for hidden_states and attentions when not requested, ensuring correct output length counting

Why is this needed?

This aligns ViLT with the new output handling patterns introduced in #43979.

github-actions · 2026-02-17T16:33:37Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: vilt

[ViLT] Refactor output tracing using capture_outputs decorator

64715d4

aman-coder03 changed the title ~~[ViLT] Refactor output tracing using capture_outputs decorator~~ [ViLT] Refactor output handling to align with standardized patterns Feb 17, 2026

fix lint issues

c26d726

This was referenced Apr 29, 2026

Cumulative feature and defect updates from recent Transformers PRs evalstate/transformers#42

Open

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#43

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ViLT] Refactor output handling to align with standardized patterns#44098

[ViLT] Refactor output handling to align with standardized patterns#44098
aman-coder03 wants to merge 2 commits intohuggingface:mainfrom
aman-coder03:refactor-vilt-output-capture

aman-coder03 commented Feb 17, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

aman-coder03 commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Why is this needed?

Uh oh!

github-actions Bot commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

aman-coder03 commented Feb 17, 2026 •

edited

Loading