[mergeability] Cluster cluster-43366-4: merged 1 PRs by evalstate · Pull Request #2 · evalstate/transformers

evalstate · 2026-04-21T16:53:40Z

Cluster: cluster-43366-4
Base: origin/main

Merged:

Add full GGUF loading support for GPT‑OSS (fixes #43366, supersedes #43757) latest huggingface/transformers#45506: Canonical cluster PR; fetched from contributor branch and merged cleanly with no conflicts.

Skipped:

Add full GGUF loading support for GPT‑OSS (fixes #43366, supersedes #43757) latest huggingface/transformers#45500: Older draft predecessor of Add full GGUF loading support for GPT‑OSS (fixes #43366, supersedes #43757) latest huggingface/transformers#45506 from the same author/topic; superseded by the canonical non-draft PR.
Avoid hard failure for gpt-oss GGUF architecture by falling back to g… huggingface/transformers#43757: Superseded partial fallback fix; GitHub reported it CONFLICTING/DIRTY and it includes unrelated GroundingDino changes.

Failed:

None.

Notes:

Requested branch name was not present initially; created merge-cluster-cluster-43366-4-20260421165030 locally from the same base commit before merging.
Cluster inspection via pr-search-cli showed issue GGUF model with architecture gpt-oss support huggingface/transformers#43366 with PRs Avoid hard failure for gpt-oss GGUF architecture by falling back to g… huggingface/transformers#43757, Add full GGUF loading support for GPT‑OSS (fixes #43366, supersedes #43757) latest huggingface/transformers#45500, and Add full GGUF loading support for GPT‑OSS (fixes #43366, supersedes #43757) latest huggingface/transformers#45506.
Current branch HEAD after merge is commit a44f036.
Repository remains coherent on the target branch; only pre-existing untracked agent/log files remain in the working tree.

Next steps:

Run targeted tests for GGUF GPT-OSS loading if additional validation is desired.
Use Add full GGUF loading support for GPT‑OSS (fixes #43366, supersedes #43757) latest huggingface/transformers#45506 as the cluster merge result and ignore superseded PRs Add full GGUF loading support for GPT‑OSS (fixes #43366, supersedes #43757) latest huggingface/transformers#45500 and Avoid hard failure for gpt-oss GGUF architecture by falling back to g… huggingface/transformers#43757.

…el __all__

…366-4-20260421165030

* Support the new privacy model (second try) This time I'm building the conversion scripts and everything from main branch, rather than building on top of existing GPT-OSS support (#2). Tested by converting multiple checkpoints and comparing logits and predictions on several texts. Signed-off-by: Mihai Maruseac <mihaimaruseac@openai.com> * Keep existing gpt_oss converter and duplicate rather than dry Signed-off-by: Mihai Maruseac <mihaimaruseac@openai.com> * Implement reviewer feedback This is partial implementation as we also simplified the model since then. For example, there's no more need for scale tensors. Still have to do documentation and some integration tests, but wanted a next round of review. Signed-off-by: Mihai Maruseac <mihaimaruseac@openai.com> * Add an integration test Signed-off-by: Mihai Maruseac <mihaimaruseac@openai.com> * Add a stub of documentation Signed-off-by: Mihai Maruseac <mihaimaruseac@openai.com> * more modular and interface friendly * fixup * fix * fixup * Make export still match logits and predictions with upstream model Signed-off-by: Mihai Maruseac <mihaimaruseac@openai.com> * alignments * Migrate to newest model checkpoint Signed-off-by: Mihai Maruseac <mihaimaruseac@openai.com> * Remove some of the code that is no longer needed Signed-off-by: Mihai Maruseac <mihaimaruseac@openai.com> * Remove some of the code that is no longer needed Signed-off-by: Mihai Maruseac <mihaimaruseac@openai.com> * sync with latest changes * keep sinks in fp32 - adjust FA and Flex to cast into half (less precise) needs moe to be fixed * try fix moe accumulation * push to sync * default to eager moe * lets readd this after merge just for the signature * last nits * Apply suggestions from code review Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> * Migrate to the proper name Signed-off-by: Mihai Maruseac <mihaimaruseac@openai.com> * Sort Signed-off-by: Mihai Maruseac <mihaimaruseac@openai.com> * Sort Signed-off-by: Mihai Maruseac <mihaimaruseac@openai.com> * Finalize documentation Signed-off-by: Mihai Maruseac <mihaimaruseac@openai.com> * small fixes * style --------- Signed-off-by: Mihai Maruseac <mihaimaruseac@openai.com> Co-authored-by: Mihai Maruseac <mihaimaruseac@openai.com>

sirzechs66 and others added 10 commits April 18, 2026 14:05

Add GPT-OSS GGUF support with YaRN rope scaling reconstruction

69f669e

Add GGUF loading test suite for GPT‑OSS

073b3d3

docs: add GGUF loading section to gpt_oss.md

8c33e37

fix: correct import of GptOssTensorProcessor in test; remove from mod…

53b7efb

…el __all__

Finalize GPT‑OSS GGUF support: move test, adjust config reconstruction

fcde5f8

fixed docs not closing example bracket

c6945b3

Fix lint: remove trailing whitespace

af5ad57

Fix tensor construction consistency

a3b7e4e

reverting to original docs

e08938e

Merge remote-tracking branch 'pr/45506' into merge-cluster-cluster-43…

a44f036

…366-4-20260421165030

evalstate closed this Apr 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[mergeability] Cluster cluster-43366-4: merged 1 PRs#2

[mergeability] Cluster cluster-43366-4: merged 1 PRs#2
evalstate wants to merge 10 commits intomainfrom
merge-cluster-cluster-43366-4-20260421165257

evalstate commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

evalstate commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants