Fix missing property access for multimodal models by albertvillanova · Pull Request #966 · linkedin/Liger-Kernel

albertvillanova · 2025-12-04T14:28:01Z

Summary

This PR fixes access to missing attributes for multimodal models in src/liger_kernel/transformers/monkey_patch.py. The main change is to consistently access attributes (like language_model, vision_tower, and visual) through the submodel .model attribute of the parent model, rather than directly from the parent model itself.

This fixes AttributeError after this PR was merged in transformers:

🚨 Generalize get_decoder() for multimodal and delete redundant code 🔪 huggingface/transformers#42156

See associated issue in TRL:

CI fails with dev dependencies: AttributeError: 'Qwen2_5_VLForConditionalGeneration' object has no attribute 'language_model' huggingface/trl#4601

Fix #960.

Details

Fix: Consistent attribute access via .model

Updated all references to submodules such as language_model, vision_tower, and visual to use the .model attribute (e.g., model.model.language_model instead of model.language_model) across all kernel application functions for models including LLava, Mllama, Gemma3, PaliGemma, Qwen2 VL, Qwen2.5 VL, Qwen3 VL, Qwen3 VL MoE, GLM4V, GLM4V MoE, and InternVL.

Normalization and patching logic updates

Adjusted normalization and patching calls to operate on submodels accessed via .model, ensuring that layer normalization and RMS normalization are consistently applied to the correct components.

These changes make the codebase more maintainable and robust against future changes in model class implementations.

Testing Done

Hardware Type:
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

Tcc0403 · 2025-12-04T18:12:44Z

src/liger_kernel/transformers/monkey_patch.py

            # Note: language_model and visual properties can be accessed throught conditional class for BC.
            # Not sure if it is subject to changes in the future.
            # Reference: https://github.com/huggingface/transformers/blob/v4.52.4/src/transformers/models/qwen2_vl/modeling_qwen2_vl.py#L1698


Could you help me remove this comment? Thanks!

Tcc0403 · 2025-12-04T18:21:50Z

src/liger_kernel/transformers/monkey_patch.py

We also need to update this condition.

model.model.language for XXXForConditionalGeneration, model.language_model for XXXVLModel

Good catch!

src/liger_kernel/transformers/monkey_patch.py

Tcc0403 · 2025-12-05T15:59:35Z

There still exist some missing attribute error

MllamaForConditionalGeneration
Qwen2VLForConditionalGeneration
Qwen2_5_VLForConditionalGeneration
InternVLForConditionalGeneration
Glm4vForConditionalGeneration
Glm4vMoeForConditionalGeneration

FAILED test/transformers/test_monkey_patch.py::test_apply_liger_kernel_to_instance_for_mllama_for_conditional_generation - AttributeError: 'MllamaForConditionalGeneration' object has no attribute 'language_model'
FAILED test/transformers/test_monkey_patch.py::test_apply_liger_kernel_to_instance_for_qwen2_vl_for_conditional_generation - AttributeError: 'Qwen2VLForConditionalGeneration' object has no attribute 'language_model'
FAILED test/transformers/test_monkey_patch.py::test_apply_liger_kernel_to_instance_for_qwen2_5_vl_for_conditional_generation - AttributeError: 'Qwen2_5_VLForConditionalGeneration' object has no attribute 'language_model'
FAILED test/transformers/test_monkey_patch.py::test_apply_liger_kernel_to_instance_for_internvl - AttributeError: 'InternVLForConditionalGeneration' object has no attribute 'language_model'
FAILED test/transformers/test_monkey_patch.py::test_apply_liger_kernel_to_instance_for_glm4v - AttributeError: 'Glm4vForConditionalGeneration' object has no attribute 'language_model'
FAILED test/transformers/test_monkey_patch.py::test_apply_liger_kernel_to_instance_for_glm4v_moe - AttributeError: 'Glm4vMoeForConditionalGeneration' object has no attribute 'language_model'

Similar errors are also listed in #960 (comment). It's just a reminder for myself, not necessarily have to fix all of them in this PR! We can focus on handling all language_model property in this PR.

Tcc0403

Thanks!

Fix missing property access for multimodal models

d24880a

Tcc0403 requested changes Dec 4, 2025

View reviewed changes

albertvillanova added 8 commits December 5, 2025 09:42

Fix Qwen2VLModel

e976f4b

Fix Qwen2_5_VLModel

26f7c8d

Fix Glm4vModel

26843e0

Fix Glm4vMoeModel

1848e2f

Fix Qwen3VLModel

931575a

Fix Qwen3VLMoeModel

b85f91a

Fix InternVLModel

55f1235

Fix qwen2_5_vl vision_model

ea3f78c

albertvillanova commented Dec 5, 2025

View reviewed changes

src/liger_kernel/transformers/monkey_patch.py Show resolved Hide resolved

albertvillanova commented Dec 5, 2025

View reviewed changes

src/liger_kernel/transformers/monkey_patch.py Show resolved Hide resolved

albertvillanova commented Dec 5, 2025

View reviewed changes

src/liger_kernel/transformers/monkey_patch.py Show resolved Hide resolved

albertvillanova added 10 commits December 10, 2025 19:40

Fix test for mllama_for_conditional_generation

4104d56

Fix test for qwen2_vl_for_conditional_generation

f500a8e

Fix test for qwen2_5_vl_for_conditional_generation

3482d0b

Fix test for internvl

82379a5

Fix test for glm4v

cab64b1

Fix test for glm4v_moe

88548a6

Fix test for qwen3_vl_for_conditional_generation

dec6f0d

Fix test for qwen3_vl_moe_for_conditional_generation

8e5cca3

Fix test for paligemma

372a6d0

Fix test for gemma3_conditional_generation

b48abb3

albertvillanova requested a review from Tcc0403 December 15, 2025 06:30

Tcc0403 approved these changes Dec 15, 2025

View reviewed changes

Tcc0403 mentioned this pull request Dec 15, 2025

Transformers v5 compatibility #978

Closed

12 tasks

Merge branch 'main' into fix-960-trl-4601

1a09b04

lancerts merged commit 153d226 into linkedin:main Dec 15, 2025
3 checks passed

albertvillanova mentioned this pull request Jan 6, 2026

CI fails with dev dependencies: AttributeError: 'Qwen2_5_VLForConditionalGeneration' object has no attribute 'language_model' huggingface/trl#4601

Closed

albertvillanova mentioned this pull request Jan 6, 2026

Hotfix CI with dev dependencies: xfail test_training_vlm_and_liger huggingface/trl#4777

Merged

qgallouedec mentioned this pull request Jan 26, 2026

Transformers v5 release: extend xfail condition for TestGRPOTrainer.test_training_vlm_and_liger and update version checks huggingface/trl#4898

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix missing property access for multimodal models#966

Fix missing property access for multimodal models#966
lancerts merged 20 commits intolinkedin:mainfrom
albertvillanova:fix-960-trl-4601

albertvillanova commented Dec 4, 2025

Uh oh!

Tcc0403 Dec 4, 2025

Uh oh!

albertvillanova Dec 5, 2025

Uh oh!

Tcc0403 Dec 4, 2025

Uh oh!

albertvillanova Dec 5, 2025

Uh oh!

albertvillanova Dec 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Tcc0403 commented Dec 5, 2025 •

edited

Loading

Uh oh!

Tcc0403 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

albertvillanova commented Dec 4, 2025

Summary

Details

Testing Done

Uh oh!

Tcc0403 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

albertvillanova Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

Tcc0403 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

albertvillanova Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

albertvillanova Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Tcc0403 commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Tcc0403 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Tcc0403 commented Dec 5, 2025 •

edited

Loading