[Model] support other model #3786

zhang-prog · 2025-09-01T15:34:09Z

No description provided.

paddle-bot · 2025-09-01T15:34:14Z

Thanks for your contribution!

Copilot

Pull Request Overview

This PR adds support for a new multimodal model called "QF VL" (QFVLForConditionalGeneration) to the FastDeploy framework. The implementation includes comprehensive multimodal processing capabilities for handling both images and videos alongside text inputs.

Key changes include:

Addition of the QF VL model architecture with SigLIP vision transformer backbone
Implementation of multimodal input processing pipeline for text, images, and videos
Integration of the new model type into the existing model registry and preprocessing workflow

Reviewed Changes

Copilot reviewed 13 out of 13 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
fastdeploy/worker/gpu_model_runner.py	Adds QF model-specific RoPE embedding handling and vision feature extraction
fastdeploy/multimodal/utils.py	Removes unused import and adds formatting cleanup
fastdeploy/multimodal/registry.py	Registers the new QFVLForConditionalGeneration model type
fastdeploy/model_executor/models/qf_vl/siglip.py	Implements SigLIP vision transformer with rotary embeddings and attention mechanisms
fastdeploy/model_executor/models/qf_vl/qf_vl.py	Defines the main QF VL model architecture and weight management
fastdeploy/model_executor/models/qf_vl/projector.py	Implements vision-to-text feature projection layer
fastdeploy/model_executor/models/qf_vl/config.py	Defines configuration classes for the QF VL model
fastdeploy/model_executor/models/qf_vl/init.py	Module initialization file
fastdeploy/input/qf_vl_processor/qf_vl_processor.py	Main processor for handling QF VL multimodal inputs
fastdeploy/input/qf_vl_processor/process.py	Core data processing logic for tokenization and multimodal handling
fastdeploy/input/qf_vl_processor/image_processor.py	Image and video preprocessing implementation
fastdeploy/input/qf_vl_processor/init.py	Module initialization for QF VL processor
fastdeploy/input/preprocess.py	Integrates QF VL processor into the preprocessing pipeline

Comments suppressed due to low confidence (1)

fastdeploy/input/qf_vl_processor/process.py:1

The dtype parameter should be passed to np.concatenate as a separate argument, not as a keyword argument. The correct syntax is np.concatenate([...]).astype(np.int64).

"""

Copilot · 2025-09-02T02:31:05Z

fastdeploy/worker/gpu_model_runner.py

+                        )
+                    else:


The conditional logic for QF model handling should include the original code inside the else block. The current structure suggests the else block is empty, which may break existing functionality for non-QF models.

Copilot · 2025-09-02T02:31:06Z

fastdeploy/model_executor/models/qf_vl/siglip.py

+            patch_embeds = self.patch_embedding(pixel_values.to(dtype=target_dtype))  # shape = [*, width, grid, grid]
+            embeddings = patch_embeds.flatten(-2).squeeze(-1)
+            embeddings = rearrange(embeddings, "(b l) d -> b l d", b=batch_size, l=squence_len)
+            # todo: not dubug


The comment contains a typo. 'dubug' should be 'debug'.

Suggested change

# todo: not dubug

# todo: not debug

Copilot · 2025-09-02T02:31:06Z

fastdeploy/model_executor/models/qf_vl/qf_vl.py

+                    **({extra.value: True} if extra else {}),
+                }
+
+                if "lm_head.weight" or "" in weight_name:


This condition will always evaluate to True because 'lm_head.weight' is a non-empty string. The intended logic appears to be checking if weight_name contains 'lm_head.weight' or is empty.

Suggested change

if "lm_head.weight" or "" in weight_name:

if "lm_head.weight" in weight_name or weight_name == "":

Copilot · 2025-09-02T02:31:06Z

fastdeploy/input/qf_vl_processor/qf_vl_processor.py

+        tool_parser_obj=None,
+    ):
+        """
+        Initialize QwenVLProcessor instance.


The docstring incorrectly refers to 'QwenVLProcessor' instead of 'QFVLProcessor'.

Suggested change

Initialize QwenVLProcessor instance.

Initialize QFVLProcessor instance.

Jiang-Jia-Jun · 2025-09-02T02:32:29Z

fastdeploy/model_executor/models/qf_vl/projector.py

@@ -0,0 +1,107 @@
+"""
+# Copyright (c) 2024 PaddlePaddle Authors. All Rights Reserved.


zhang-prog · 2026-01-21T09:03:05Z

In #4396

root and others added 2 commits September 1, 2025 23:32

init

afff30f

update code

f35aa8e

Jiang-Jia-Jun requested a review from Copilot September 2, 2025 02:30

Copilot AI reviewed Sep 2, 2025

View reviewed changes

Jiang-Jia-Jun requested changes Sep 2, 2025

View reviewed changes

zhang-prog added 6 commits September 8, 2025 17:40

fix code style & disable thinking

ec2a801

adapt for common_engine.update_mm_requests_chunk_size

6abc457

Merge remote-tracking branch 'origin/develop' into feat/qfvl

3c320de

use 3d rope

54cc597

use flash_attn_unpadded

da7c3a3

opt siglip

65c3b3f

zhang-prog force-pushed the feat/qfvl branch from 4ddac23 to 65c3b3f Compare September 24, 2025 09:49

zhang-prog added 4 commits September 24, 2025 18:15

Merge remote-tracking branch 'official/develop' into feat/qfvl

fd77bac

update to be compatible with the latest codebase

f816077

fix typo

a8cdc6e

rename

6bfb15e

zhang-prog closed this Jan 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] support other model #3786

[Model] support other model #3786

Uh oh!

zhang-prog commented Sep 1, 2025

Uh oh!

paddle-bot bot commented Sep 1, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Sep 2, 2025

Uh oh!

Copilot AI Sep 2, 2025

Uh oh!

Copilot AI Sep 2, 2025

Uh oh!

Copilot AI Sep 2, 2025

Uh oh!

Jiang-Jia-Jun Sep 2, 2025

Uh oh!

zhang-prog commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	if "lm_head.weight" or "" in weight_name:
	if "lm_head.weight" in weight_name or weight_name == "":

	Initialize QwenVLProcessor instance.
	Initialize QFVLProcessor instance.

		@@ -0,0 +1,107 @@
		"""
		# Copyright (c) 2024 PaddlePaddle Authors. All Rights Reserved.

[Model] support other model #3786

[Model] support other model #3786

Uh oh!

Conversation

zhang-prog commented Sep 1, 2025

Uh oh!

paddle-bot bot commented Sep 1, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

Jiang-Jia-Jun Sep 2, 2025

Choose a reason for hiding this comment

Uh oh!

zhang-prog commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants