docs: add Mistral Medium 3.5 VLM coverage and fine-tuning guide by HuiyingLi · Pull Request #2091 · NVIDIA-NeMo/Automodel

HuiyingLi · 2026-04-29T15:19:58Z

Summary

Adds the public-facing documentation for Mistral AI's Mistral Medium 3.5 (a 128B FP8-native dense VLM with Pixtral vision tower + Ministral-3 text backbone, same architecture lineage as Devstral-2-123B).
Sibling implementation PR: feat: add mistral medium 3.5 #2090 — that PR ships the loader (Mistral3FP8VLMForConditionalGeneration), TP plan, PP forward, and 8-node MedPix recipe. This PR is documentation-only.

🤖 Generated with Claude Code

copy-pr-bot · 2026-04-29T15:20:03Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

HuiyingLi · 2026-04-29T15:21:16Z

/ok to test c500cc7

Mistral AI's new flagship 128B dense VLM is now supported in NeMo AutoModel via the Mistral3FP8VLMForConditionalGeneration custom class (Pixtral vision tower + Ministral-3 dense decoder, FP8 on disk). Mistral Medium 3.5 merges Mistral Medium 3.1, Magistral Medium, and Devstral 2 into a single checkpoint with a configurable reasoning mode and a 256k context window — open-weights under a modified MIT license. Architecturally it shares the dense Ministral-3 text backbone with mistralai/Devstral-2-123B-Instruct-2512. This commit adds the documentation for that model: - docs/model-coverage/vlm/mistralai/mistral-medium-3-5.md: model coverage page covering architecture (dense, 88-layer Ministral-3 text backbone + Pixtral vision tower), strengths/trade-offs, use cases, recipe, install/run snippets. Modeled on mistral-small-4.md. - docs/guides/vlm/mistral-medium-3-5.md: end-to-end fine-tuning guide on MedPix-VQA with the 8-node TP=8 PP=8 recipe, including a walkthrough of the FP8 dequantize-on-load path. Modeled on the Qwen3.5-VL guide. - docs/model-coverage/vlm/index.md: add Mistral Medium 3.5 row to supported models table and toctree. - docs/model-coverage/latest-models.md: prepend release row. - docs/index.md: add Mistral Medium 3.5 VL to the guides toctree. - README.md: add news bullet. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

HuiyingLi · 2026-04-29T15:24:05Z

/ok to test e5aeebb

HuiyingLi requested review from adil-a, akoumpa, athitten, hemildesai, jgerh, pthombre, snowmanwwg and zyzhou5 as code owners April 29, 2026 15:19

HuiyingLi added the docs-only With great power comes great responsibility. label Apr 29, 2026

HuiyingLi force-pushed the huiyingl/docs-mistral-medium-3-5 branch from c500cc7 to e5aeebb Compare April 29, 2026 15:23

hemildesai approved these changes Apr 29, 2026

View reviewed changes

HuiyingLi enabled auto-merge (squash) April 29, 2026 15:27

HuiyingLi disabled auto-merge April 29, 2026 15:31

HuiyingLi merged commit fab3f81 into main Apr 29, 2026
33 of 34 checks passed

HuiyingLi deleted the huiyingl/docs-mistral-medium-3-5 branch April 29, 2026 15:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: add Mistral Medium 3.5 VLM coverage and fine-tuning guide#2091

docs: add Mistral Medium 3.5 VLM coverage and fine-tuning guide#2091
HuiyingLi merged 1 commit intomainfrom
huiyingl/docs-mistral-medium-3-5

HuiyingLi commented Apr 29, 2026 •

edited

Loading

Uh oh!

copy-pr-bot Bot commented Apr 29, 2026

Uh oh!

HuiyingLi commented Apr 29, 2026

Uh oh!

HuiyingLi commented Apr 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

HuiyingLi commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

copy-pr-bot Bot commented Apr 29, 2026

Uh oh!

HuiyingLi commented Apr 29, 2026

Uh oh!

HuiyingLi commented Apr 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

HuiyingLi commented Apr 29, 2026 •

edited

Loading