Skip to content

[GLM-4.7] GLM-4.7-Flash Model Supoort#43031

Merged
ArthurZucker merged 20 commits intohuggingface:mainfrom
zRzRzRzRzRzRzR:glm
Jan 13, 2026
Merged

[GLM-4.7] GLM-4.7-Flash Model Supoort#43031
ArthurZucker merged 20 commits intohuggingface:mainfrom
zRzRzRzRzRzRzR:glm

Conversation

@zRzRzRzRzRzRzR
Copy link
Copy Markdown
Contributor

@zRzRzRzRzRzRzR zRzRzRzRzRzRzR commented Dec 24, 2025

GLM-Lite is the class updated for GLM-4.7-Flash, which adopts a different model architecture from GLM-4.7. The intermediate state was named GLM-Lite and has now been renamed to GLM-4.7-Flash. At the same time, a PR has been submitted to improve the documentation.

@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@zRzRzRzRzRzRzR
Copy link
Copy Markdown
Contributor Author

@ArthurZucker

Copy link
Copy Markdown
Collaborator

@ArthurZucker ArthurZucker left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Very nice! Almost nothing to say but that we might want to go explicit for mlp type maybe? not a must its more for use @Cyrilvallez and @vasqu ?

Comment thread src/transformers/conversion_mapping.py
Comment thread src/transformers/models/glm4_moe_lite/configuration_glm4_moe_lite.py Outdated
Comment thread src/transformers/models/glm4_moe_lite/modular_glm4_moe_lite.py
@zRzRzRzRzRzRzR zRzRzRzRzRzRzR changed the title GLM Model Tester(Lite) [GLM-4.7] GLM-Lite Supoort Jan 9, 2026
Comment thread src/transformers/models/glm4_moe_lite/modular_glm4_moe_lite.py Outdated
@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Jan 9, 2026

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto, glm4_moe, glm4_moe_lite

Copy link
Copy Markdown
Collaborator

@ArthurZucker ArthurZucker left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

@ArthurZucker ArthurZucker merged commit 76732b4 into huggingface:main Jan 13, 2026
23 checks passed
@zRzRzRzRzRzRzR zRzRzRzRzRzRzR deleted the glm branch January 13, 2026 11:20
Comment thread docs/source/en/_toctree.yml
@zRzRzRzRzRzRzR zRzRzRzRzRzRzR changed the title [GLM-4.7] GLM-Lite Supoort [GLM-4.7] GLM-4.7-Flash Model Supoort Jan 18, 2026
SangbumChoi pushed a commit to SangbumChoi/transformers that referenced this pull request Jan 23, 2026
* draft

* update

* Delete test_text.py

* update

* ignore

* Delete model.safetensors.index_30.json

* update for test

* Update tokenization_auto.py

* update test

* Update test_modeling_glm4_moe_lite.py

* update

* update

* using GradientCheckpointingLayer

* 2

* 3

* remove first_k_dense_replace

* 1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants