Skip to content

Enabled Qwen3-VL embedding model#923

Open
quic-amitraj wants to merge 1 commit intoquic:mainfrom
quic-amitraj:qwen3_vl_embedding_only_pr
Open

Enabled Qwen3-VL embedding model#923
quic-amitraj wants to merge 1 commit intoquic:mainfrom
quic-amitraj:qwen3_vl_embedding_only_pr

Conversation

@quic-amitraj
Copy link
Copy Markdown
Contributor

@quic-amitraj quic-amitraj commented Apr 20, 2026

Adds embedding-model support for Qwen/Qwen3-VL-Embedding-8B.
[MAD] CPU vs AI100 mean=1.585330e-05, max=3.049895e-04

@quic-amitraj quic-amitraj self-assigned this Apr 20, 2026
Signed-off-by: Amit Raj <amitraj@qti.qualcomm.com>
@quic-amitraj quic-amitraj force-pushed the qwen3_vl_embedding_only_pr branch from 8c05c08 to 3eb106a Compare April 20, 2026 10:27
@quic-amitraj quic-amitraj marked this pull request as ready for review April 20, 2026 10:33
@quic-rishinr quic-rishinr requested a review from qcdipankar April 24, 2026 05:41

CONFIG_PATH = "tests/configs/image_text_model_configs.json"

DEFAULT_MAD_MAX = 1e-3
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we move this to constants file

Comment on lines +98 to +100
config.text_config.num_hidden_layers = model_cfg["num_layers"]
config.vision_config.depth = model_cfg["vision_depth"]
config.vision_config.deepstack_visual_indexes = [model_cfg["deepstack_index"]]
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this we can comment if we are planning to run full layer?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants