Skip to content

[fix] mistral 4 docs#44776

Merged
stevhliu merged 1 commit intohuggingface:mainfrom
stevhliu:mistral-fix
Mar 16, 2026
Merged

[fix] mistral 4 docs#44776
stevhliu merged 1 commit intohuggingface:mainfrom
stevhliu:mistral-fix

Conversation

@stevhliu
Copy link
Copy Markdown
Member

the doc-builder is breaking because it can't find Mistral4ForQuestionAnswering, which looks like it doesn't exist

@stevhliu stevhliu requested review from ArthurZucker and vasqu March 16, 2026 20:53
@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Copy Markdown
Contributor

@vasqu vasqu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks!

@stevhliu stevhliu added this pull request to the merge queue Mar 16, 2026
Merged via the queue into huggingface:main with commit 75c836b Mar 16, 2026
17 checks passed
@stevhliu stevhliu deleted the mistral-fix branch March 16, 2026 21:11
NielsRogge pushed a commit to NielsRogge/transformers that referenced this pull request Mar 17, 2026
vasqu added a commit that referenced this pull request Mar 25, 2026
* First draft

* [Videomt] Extend query-stage parity checks to 3-frame inputs

* [Videomt] Add full-model parity check against EoMT reference

* [Videomt] Compare conversion against official GitHub reference

* [Videomt] Simplify conversion to checkpoint-based HF mapping

* [Videomt] Add --verify mode against upstream GitHub implementation

* [Videomt] Improve --verify diagnostics with key remapping and layer checks

* [Videomt] Improve verify backbone candidate fallback and remapping

* [Videomt] Add DINOv3 verify compatibility patch and progress logging

* [Videomt] Extend verify diagnostics with MLP/head parity checks

* [Videomt] Make --verify succeed for converted weight mapping scope

* [videomt] Improve verify adapters and candidate traceback diagnostics

* [videomt] Adapt verify _pos_embed output for DINOv3 candidates

* [videomt] Enable DINOv3 verify candidate by adapting EVA head_dim

* [videomt] Add pre-query layer diagnostics to verify flow

* [videomt] Add deterministic verify probes and deeper pre-query diffs

* [videomt] Penalize skipped keys in verify candidate scoring

* [videomt] Add no-rope A/B diagnostics to verify pre-query layers

* [videomt] Add branch-level pre-query diagnostics to verify

* [videomt] Add fine-grained MLP diagnostics to verify

* [videomt] Verify layer-scale mapping parity in --verify

* [videomt] Validate MLP diagnostic decomposition in verify

* [videomt] Add token-group diagnostics for layer-4 MLP divergence

* [VidEoMT] Add temporal query updater path and re-verify yt_2019_vit_small

* [VidEoMT] Refine 5D execution order and re-check small checkpoint parity

* Simplify conversion script and convert all dinov2 checkpoints

* Add id2label mappings

* Fix all tests

* Add to auto mapping

* Simplify verify_conversion_against_github_reference

* Update absolute tolerance

* Update date

* Revert AGENTS.md

* Address comments

* Add circleci skill, fix circleci

* Fix CI

* Remove skills from git

* Address comments

* Address more comments

* Address comment

* Add docstrigns

* Restore AGENTS.md

* Address comment

* fix this one

* Address comments

* [fix] mistral 4 docs (#44776)

fix

* Address comment

* add expectations

* Update date

* Make fix-repo

* fix multi gpu

* fix with changes on main

* fix date

---------

Co-authored-by: vasqu <antonprogamer@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>
zucchini-nlp pushed a commit to zucchini-nlp/transformers that referenced this pull request Mar 27, 2026
* First draft

* [Videomt] Extend query-stage parity checks to 3-frame inputs

* [Videomt] Add full-model parity check against EoMT reference

* [Videomt] Compare conversion against official GitHub reference

* [Videomt] Simplify conversion to checkpoint-based HF mapping

* [Videomt] Add --verify mode against upstream GitHub implementation

* [Videomt] Improve --verify diagnostics with key remapping and layer checks

* [Videomt] Improve verify backbone candidate fallback and remapping

* [Videomt] Add DINOv3 verify compatibility patch and progress logging

* [Videomt] Extend verify diagnostics with MLP/head parity checks

* [Videomt] Make --verify succeed for converted weight mapping scope

* [videomt] Improve verify adapters and candidate traceback diagnostics

* [videomt] Adapt verify _pos_embed output for DINOv3 candidates

* [videomt] Enable DINOv3 verify candidate by adapting EVA head_dim

* [videomt] Add pre-query layer diagnostics to verify flow

* [videomt] Add deterministic verify probes and deeper pre-query diffs

* [videomt] Penalize skipped keys in verify candidate scoring

* [videomt] Add no-rope A/B diagnostics to verify pre-query layers

* [videomt] Add branch-level pre-query diagnostics to verify

* [videomt] Add fine-grained MLP diagnostics to verify

* [videomt] Verify layer-scale mapping parity in --verify

* [videomt] Validate MLP diagnostic decomposition in verify

* [videomt] Add token-group diagnostics for layer-4 MLP divergence

* [VidEoMT] Add temporal query updater path and re-verify yt_2019_vit_small

* [VidEoMT] Refine 5D execution order and re-check small checkpoint parity

* Simplify conversion script and convert all dinov2 checkpoints

* Add id2label mappings

* Fix all tests

* Add to auto mapping

* Simplify verify_conversion_against_github_reference

* Update absolute tolerance

* Update date

* Revert AGENTS.md

* Address comments

* Add circleci skill, fix circleci

* Fix CI

* Remove skills from git

* Address comments

* Address more comments

* Address comment

* Add docstrigns

* Restore AGENTS.md

* Address comment

* fix this one

* Address comments

* [fix] mistral 4 docs (huggingface#44776)

fix

* Address comment

* add expectations

* Update date

* Make fix-repo

* fix multi gpu

* fix with changes on main

* fix date

---------

Co-authored-by: vasqu <antonprogamer@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>
NielsRogge added a commit to NielsRogge/transformers that referenced this pull request Mar 30, 2026
* First draft

* [Videomt] Extend query-stage parity checks to 3-frame inputs

* [Videomt] Add full-model parity check against EoMT reference

* [Videomt] Compare conversion against official GitHub reference

* [Videomt] Simplify conversion to checkpoint-based HF mapping

* [Videomt] Add --verify mode against upstream GitHub implementation

* [Videomt] Improve --verify diagnostics with key remapping and layer checks

* [Videomt] Improve verify backbone candidate fallback and remapping

* [Videomt] Add DINOv3 verify compatibility patch and progress logging

* [Videomt] Extend verify diagnostics with MLP/head parity checks

* [Videomt] Make --verify succeed for converted weight mapping scope

* [videomt] Improve verify adapters and candidate traceback diagnostics

* [videomt] Adapt verify _pos_embed output for DINOv3 candidates

* [videomt] Enable DINOv3 verify candidate by adapting EVA head_dim

* [videomt] Add pre-query layer diagnostics to verify flow

* [videomt] Add deterministic verify probes and deeper pre-query diffs

* [videomt] Penalize skipped keys in verify candidate scoring

* [videomt] Add no-rope A/B diagnostics to verify pre-query layers

* [videomt] Add branch-level pre-query diagnostics to verify

* [videomt] Add fine-grained MLP diagnostics to verify

* [videomt] Verify layer-scale mapping parity in --verify

* [videomt] Validate MLP diagnostic decomposition in verify

* [videomt] Add token-group diagnostics for layer-4 MLP divergence

* [VidEoMT] Add temporal query updater path and re-verify yt_2019_vit_small

* [VidEoMT] Refine 5D execution order and re-check small checkpoint parity

* Simplify conversion script and convert all dinov2 checkpoints

* Add id2label mappings

* Fix all tests

* Add to auto mapping

* Simplify verify_conversion_against_github_reference

* Update absolute tolerance

* Update date

* Revert AGENTS.md

* Address comments

* Add circleci skill, fix circleci

* Fix CI

* Remove skills from git

* Address comments

* Address more comments

* Address comment

* Add docstrigns

* Restore AGENTS.md

* Address comment

* fix this one

* Address comments

* [fix] mistral 4 docs (huggingface#44776)

fix

* Address comment

* add expectations

* Update date

* Make fix-repo

* fix multi gpu

* fix with changes on main

* fix date

---------

Co-authored-by: vasqu <antonprogamer@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants