Avoid hard failure for gpt-oss GGUF architecture by falling back to g… by TheSanjBot · Pull Request #43757 · huggingface/transformers

TheSanjBot · 2026-02-05T07:56:19Z

What does this PR do?

This PR avoids a hard failure when loading GGUF models that declare the
gpt-oss architecture.

Currently, such models raise a ValueError during GGUF config loading.
This change maps gpt-oss to the closest supported architecture
(gpt-neox) and emits a clear warning to communicate current limitations.

The goal is to allow GGUF checkpoints using gpt-oss to be loaded without
crashing, enabling downstream tools (e.g. vLLM) to proceed.

Notes / Limitations

This does not implement full GPT-OSS support.
MoE layers are not supported and inference correctness is not guaranteed.
This is a best-effort fallback to avoid hard failure, not a claim of correctness.

Fixes #43366

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

No tests were added as GGUF integration tests require large binary artifacts
and this change only affects architecture handling and error prevention.

Who can review?

cc @SunMarc @Rocketknight1

…pt-neox

…ssor

TheSanjBot · 2026-02-05T09:27:08Z

CI failure seems unrelated to this PR.

The failing test (test_sample_generate_dict_output in GLM Image) is marked as FLAKY
and fails inside modeling_glm_image.py, which is not touched here.

Could you please rerun CI?

github-actions · 2026-02-05T17:42:11Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: grounding_dino

…upersedes huggingface#43757) latest (huggingface#45506) * Add GPT-OSS GGUF support with YaRN rope scaling reconstruction * Add GGUF loading test suite for GPT‑OSS * docs: add GGUF loading section to gpt_oss.md * fix: correct import of GptOssTensorProcessor in test; remove from model __all__ * Finalize GPT‑OSS GGUF support: move test, adjust config reconstruction * fixed docs not closing example bracket * Fix lint: remove trailing whitespace * Fix tensor construction consistency * reverting to original docs

TheSanjBot added 3 commits February 5, 2026 13:16

Avoid hard failure for gpt-oss GGUF architecture by falling back to g…

d205ef7

…pt-neox

Declare pixel_mask in GroundingDinoProcessor model_input_names

de90f48

Fix duplicate valid_processor_kwargs definition in GroundingDinoProce…

37a7c31

…ssor

Merge branch 'main' into gguf-gpt-oss-support

a17da9b

This was referenced Mar 30, 2026

Add full GGUF loading support for GPT‑OSS (fixes #43366) #45116

Closed

Add full GGUF loading support for GPT‑OSS (fixes #43366, supersedes #43757) #45118

Closed

This was referenced Apr 18, 2026

Add full GGUF loading support for GPT‑OSS (fixes #43366, supersedes #43757) latest #45500

Closed

Add full GGUF loading support for GPT‑OSS (fixes #43366, supersedes #43757) latest #45506

Merged

evalstate mentioned this pull request Apr 21, 2026

[mergeability] Cluster cluster-43366-4: merged 1 PRs evalstate/transformers#2

Closed

evalstate mentioned this pull request Apr 28, 2026

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#41

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid hard failure for gpt-oss GGUF architecture by falling back to g…#43757

Avoid hard failure for gpt-oss GGUF architecture by falling back to g…#43757
TheSanjBot wants to merge 4 commits intohuggingface:mainfrom
TheSanjBot:gguf-gpt-oss-support

TheSanjBot commented Feb 5, 2026

Uh oh!

TheSanjBot commented Feb 5, 2026

Uh oh!

github-actions Bot commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

TheSanjBot commented Feb 5, 2026

What does this PR do?

Notes / Limitations

Before submitting

Who can review?

Uh oh!

TheSanjBot commented Feb 5, 2026

Uh oh!

github-actions Bot commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant