feat: add Qwen3.6-35B-A3B VLM finetune recipe by HuiyingLi · Pull Request #1882 · NVIDIA-NeMo/Automodel

HuiyingLi · 2026-04-16T18:53:14Z

Summary

Adds examples/vlm_finetune/qwen3_5_moe/qwen3_6_35b.yaml — a MedPix-VQA fine-tuning recipe for Qwen/Qwen3.6-35B-A3B (next-gen Qwen3 MoE, same qwen3_5_moe arch).
Adds news bullet in README.md, a row in docs/model-coverage/latest-models.md, and entries in docs/model-coverage/vlm/qwen/qwen3-5-vl.md (available models + example recipes).
Follows the docs pattern established by feat: minimax m27 #1785.

Test plan

100-step run on 8×H100 (ep=8, FSDP2) completes cleanly: loss 1.86 → ~1.5, peak mem 64 GiB/GPU.
wandb logging verified: https://wandb.ai/Nemo-automodel/qwen3_6_35b/runs/afaoc6s7
Docs build passes in CI
Reviewer confirms news date / model-coverage entries render correctly

Note: requires the collate_fn fix in #1799 for the default MedPix recipe (local_batch_size: 1, max_length: 2048) to avoid occasional batches where the only sample exceeds max_length.

🤖 Generated with Claude Code

copy-pr-bot · 2026-04-16T18:53:18Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Adds a ready-to-run MedPix-VQA fine-tuning recipe for `Qwen/Qwen3.6-35B-A3B` under the existing `qwen3_5_moe` architecture (same custom model impl). Verified on 8×H100: 100 steps complete, loss 1.86 → ~1.5, peak mem 64 GiB/GPU. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> Signed-off-by: HuiyingLi <willwin.lee@gmail.com>

HuiyingLi · 2026-04-16T18:55:10Z

/ok to test f6a4f99

Adds a ready-to-run MedPix-VQA fine-tuning recipe for `Qwen/Qwen3.6-35B-A3B` under the existing `qwen3_5_moe` architecture (same custom model impl). Verified on 8×H100: 100 steps complete, loss 1.86 → ~1.5, peak mem 64 GiB/GPU. Signed-off-by: HuiyingLi <willwin.lee@gmail.com> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

HuiyingLi requested review from ZhiyuLi-Nvidia, adil-a, akoumpa, hemildesai, jgerh, pthombre and snowmanwwg as code owners April 16, 2026 18:53

HuiyingLi force-pushed the huiyingl/add_qwen3_6_35b_config branch from 7a16517 to f6a4f99 Compare April 16, 2026 18:54

HuiyingLi added the docs-only With great power comes great responsibility. label Apr 16, 2026

copy-pr-bot Bot temporarily deployed to nemo-ci April 16, 2026 18:55 Inactive

hemildesai approved these changes Apr 16, 2026

View reviewed changes

snowmanwwg approved these changes Apr 16, 2026

View reviewed changes

snowmanwwg merged commit 906ecae into main Apr 16, 2026
32 checks passed

snowmanwwg deleted the huiyingl/add_qwen3_6_35b_config branch April 16, 2026 19:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Qwen3.6-35B-A3B VLM finetune recipe#1882

feat: add Qwen3.6-35B-A3B VLM finetune recipe#1882
snowmanwwg merged 1 commit intomainfrom
huiyingl/add_qwen3_6_35b_config

HuiyingLi commented Apr 16, 2026

Uh oh!

copy-pr-bot Bot commented Apr 16, 2026

Uh oh!

HuiyingLi commented Apr 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

HuiyingLi commented Apr 16, 2026

Summary

Test plan

Uh oh!

copy-pr-bot Bot commented Apr 16, 2026

Uh oh!

HuiyingLi commented Apr 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants