Added data generation & training support for qwen3 vl model #228

shanjiaz · 2025-12-18T21:00:18Z

vl model support:

Extract text_config from multimodal model configs in vocab mapping, training, and model initialization
Added kwargs to support vision language parameters like deepstack_input_embeds in custom worker for Qwen3-VL's multimodal processing
Use vLLM's native get_language_model() method to access the text model in vision-language architectures
Detect base model structure for vision-language models

Util changes:

Add suffix pattern matching for weight keys (e.g., "embed_tokens.weight" matches "model.embed_tokens.weight")
Support single-file safetensors models (no index.json) by building virtual weight maps
Run e2e test for a qwen 3 vl model

Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>

github-actions · 2025-12-18T21:00:55Z

📦 Build Artifacts Available
The build artifacts (`.whl` and `.tar.gz`) have been successfully generated and are available for download: https://github.com/vllm-project/speculators/actions/runs/20759905084/artifacts/5041230401.
They will be retained for up to 30 days.
Commit: 9c5031d

Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>

github-actions · 2025-12-18T22:06:44Z

Summary

Status	Count
🔍 Total	158
✅ Successful	155
⏳ Timeouts	0
🔀 Redirected	2
👻 Excluded	0
❓ Unknown	0
🚫 Errors	1
⛔ Unsupported	0

Errors per input

Errors in docs/developer/code-of-conduct.md

[ERROR] https://www.contributor-covenant.org/translations/ | Network error: Connection reset by server. Server forcibly closed connection

Redirects per input

Redirects in CONTRIBUTING.md

[200] https://docs.pytest.org/ | Redirect: Followed 1 redirect resolving to the final status of: OK. Redirects: https://docs.pytest.org/ --> https://docs.pytest.org/en/stable/

Redirects in docs/developer/contributing.md

[200] https://docs.pytest.org/ | Redirect: Followed 1 redirect resolving to the final status of: OK. Redirects: https://docs.pytest.org/ --> https://docs.pytest.org/en/stable/
Full Github Actions output

Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>

fynnsu

Looks good!

src/speculators/data_generation/custom_worker.py

Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>

rahul-tuli

LGTM! great job

Added data generation & training support for qwen3 vl model

28c8c28

Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>

shanjiaz added 2 commits December 18, 2025 21:02

style

d961226

Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>

style

8125605

Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>

shanjiaz force-pushed the support-qwen3-vl branch from 661718b to 8125605 Compare December 18, 2025 22:07

shanjiaz and others added 7 commits December 18, 2025 22:16

fix types

511cec3

Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>

format

dbd724d

Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>

Merge branch 'main' into support-qwen3-vl

0daa978

cleanup

d62aebc

Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>

pre commit

c37cdb1

Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>

comments

0ee1ca3

Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>

Merge branch 'main' into support-qwen3-vl

3a2c5a2

fynnsu previously approved these changes Jan 5, 2026

View reviewed changes

src/speculators/data_generation/custom_worker.py Outdated Show resolved Hide resolved

shanjiaz and others added 2 commits January 5, 2026 13:15

Merge branch 'main' into support-qwen3-vl

8aa15cf

use kwargs instead

8f7336c

Signed-off-by: shanjiaz <zsjwpianpian@gmail.com>

shanjiaz dismissed fynnsu’s stale review via 8f7336c January 5, 2026 18:17

shanjiaz marked this pull request as ready for review January 5, 2026 18:17

shanjiaz requested review from MeganEFlynn, fynnsu and rahul-tuli January 5, 2026 18:53

fynnsu approved these changes Jan 6, 2026

View reviewed changes

Merge branch 'main' into support-qwen3-vl

9c5031d

rahul-tuli approved these changes Jan 7, 2026

View reviewed changes

shanjiaz merged commit 8374278 into main Jan 7, 2026
14 checks passed

shanjiaz deleted the support-qwen3-vl branch January 7, 2026 16:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added data generation & training support for qwen3 vl model #228

Added data generation & training support for qwen3 vl model #228

shanjiaz commented Dec 18, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Dec 18, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Dec 18, 2025

Uh oh!

fynnsu left a comment

Uh oh!

Uh oh!

rahul-tuli left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Added data generation & training support for qwen3 vl model #228

Added data generation & training support for qwen3 vl model #228

Conversation

shanjiaz commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Dec 18, 2025

Summary

Errors per input

Errors in docs/developer/code-of-conduct.md

Redirects per input

Redirects in CONTRIBUTING.md

Redirects in docs/developer/contributing.md

Uh oh!

fynnsu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rahul-tuli left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

shanjiaz commented Dec 18, 2025 •

edited

Loading

github-actions bot commented Dec 18, 2025 •

edited

Loading