feat: implement sequential prefill fallback for multimodal runner and add support for Qwen3.5 with vision encoder by barhanc · Pull Request #1102 · software-mansion/react-native-executorch

barhanc · 2026-04-27T13:31:21Z

Description

Implements sequential prefill fallback for multimodal runner required to support Qwen3.5 with vision capabilities.
Adds symbols for Qwen3.5 VL model
Adds model picker component to multimodal screen in LLM example app, similar to LLM screen

NOTE: Support for Qwen3.5 VL model is experimental right now since due to architectural constraints (namely the GatedDeltaNet implementation) model requires sequential prefill fallback which results in very slow prefill. Additionally the model output right now is not very good (repetition, etc.) which probably can be fixed by applying #1099 .

The best action for now is probably to wait until the ExecuTorch team adds better support for Qwen3.5 architecture.

Introduces a breaking change?

Yes
No

Type of change

Bug fix (change which fixes an issue)
New feature (change which adds functionality)
Documentation update (improves or adds clarity to existing documentation)
Other (chores, tests, code style improvements etc.)

Tested on

iOS
Android

Testing instructions

Run example app and test new VLM models

Screenshots

Related issues

Checklist

I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have updated the documentation accordingly
My changes generate no new warnings

Additional notes

Qwen3.5 VLM model was exported using https://github.com/barhanc/executorch/tree/rne-v1.2.0-export-qwen-3.5 .

msluszniak · 2026-04-29T14:29:58Z

For now you can rebase and:

Resolve conflicts
Test if merged feat(llm): min_p and repetition_penalty sampling, per-model defaults, letterbox vision #1099 actually fixed repetition. This will help us detect if the problem is model base or was caused by the incorrect configuration.

…to multimodal screen

…multimodal screen Co-authored-by: Copilot <copilot@github.com>

…odels

…n 3.5 VL 0.8B and 2B models

barhanc · 2026-04-30T12:52:43Z

Based on my limited testing it looks like the proper configuration actually somewhat helped. However, the models themselves aren't particularly good, especially compared to something like LFM2.5. And as of right now, they aren't really usable due to slow prefill.

ExecuTorch team merged the PR with custom op for GatedDeltaNet, but they reverted it later (pytorch/executorch#19178), so I guess we still have to wait.

barhanc self-assigned this Apr 27, 2026

barhanc added model Issues related to exporting, improving, fixing ML models blocked Issue blocked by some problems (but not other issue, use relationship -> blocker instead) feature PRs that implement a new feature labels Apr 27, 2026

barhanc and others added 6 commits April 30, 2026 13:30

feat(llm): implement sequential prefill fallback for multimodal runner

a2b26ae

feat(llm): add Qwen 3.5 VL support and model selection functionality …

2dfcc95

…to multimodal screen

fix(llm): update model selection from Qwen 3.5 VL to LFM2 VL 450M in …

4946797

…multimodal screen Co-authored-by: Copilot <copilot@github.com>

feat(llm): add generation configuration for Qwen 3.5 0.8B and 2B models

86ebbc9

feat(llm): add generation configuration for Qwen 3.5 VL 0.8B and 2B m…

c04c27b

…odels

feat(llm): add repetition penalty to generation configuration for Qwe…

97257ad

…n 3.5 VL 0.8B and 2B models

barhanc force-pushed the @bh/add-qwen3.5-vl branch from b006379 to 97257ad Compare April 30, 2026 12:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement sequential prefill fallback for multimodal runner and add support for Qwen3.5 with vision encoder#1102

feat: implement sequential prefill fallback for multimodal runner and add support for Qwen3.5 with vision encoder#1102
barhanc wants to merge 6 commits intomainfrom
@bh/add-qwen3.5-vl

barhanc commented Apr 27, 2026 •

edited

Loading

Uh oh!

msluszniak commented Apr 29, 2026

Uh oh!

barhanc commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

barhanc commented Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Introduces a breaking change?

Type of change

Tested on

Testing instructions

Screenshots

Related issues

Checklist

Additional notes

Uh oh!

msluszniak commented Apr 29, 2026

Uh oh!

barhanc commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

barhanc commented Apr 27, 2026 •

edited

Loading