VLM Pipeline for Model Onboarding through QEff by qcdipankar · Pull Request #261 · quic/efficient-transformers

qcdipankar · 2025-02-06T01:18:34Z

Features Added
1.Original modeling files removed for Intern. Generic Solution for Models not part of transformers.
2.Used Model Wrapper inside modeling files to put the generate_inputs functions. Calls will be made based on model at pretrained.
3.Constant file updated.
4.Removed pytorch generate from modeling_auto
5.General Clean up of code done

Tested and Verified on

Intern
LLava

TODO

Replace einops ops using torch ops
Final cleaning and commenting required.

Signed-off-by: Dipankar Sarkar <quic_dipankar@quicinc.com>

quic-akuruvil · 2025-02-06T04:11:25Z

+import torch.nn.functional as F
+import torch.utils.checkpoint
+import transformers
+from einops import rearrange


Not needed, please remove

ochougul

REVIEW WIP.

ochougul · 2025-02-06T05:34:23Z

ochougul · 2025-02-06T05:34:44Z

ochougul · 2025-02-06T05:34:54Z

ochougul · 2025-02-06T05:42:02Z

+            # if repl_module := cls._module_mapping.get(type(module)):
+            if repl_module := cls._module_mapping.get(module.__class__.__name__):
+                module.__class__ = repl_module
+                # Handling the __init__ calls in the models
+                if hasattr(module, "__qeff_init__"):
+                    module.__qeff_init__()
+                transformed = True
+


create a new transform named like:

class ModuleMappingViaStringAndClassMatchTransform: _module_mapping_via_class: Dict[Type[nn.Module], Type[nn.Module]] _module_mapping_via_string: Dict[string, Type[nn.Module]] @classmethod def apply(cls, model): transformed=False for module in model.modules(): if repl_module := cls._module_mapping_via_class.get(type(module)): # replace the class here elif repl_module := cls._module_mapping_via_string.get(type(module)): # replace the class here

Create two different dicts basically.

And write a test that makes sure the keys on the two dicts don't match.

ochougul · 2025-02-06T05:43:10Z

+def get_num_layers_vlm(config):
+    if hasattr(config, "architectures") and "LlavaForConditionalGeneration" in config.architectures:
+        num_layers = config.text_config.num_hidden_layers
+    return num_layers


can't we reuse existing method named get_num_layers_from_config and pass model.config.text_config to it?

In some models it is text_config, In some it is llm_config, txt_config etc. Hence adding it as a new function for vlm architecture

Let's keep it separate to avoid cluttering it with multiple conditions inside same function?

ochougul · 2025-02-06T05:43:31Z

+def get_padding_shape_vlm(config, batch_size=1):
+    if hasattr(config, "architectures") and "LlavaForConditionalGeneration" in config.architectures:
+        n_heads = config.text_config.num_key_value_heads
+        d_head = config.text_config.hidden_size // config.text_config.num_attention_heads
+        padding_shape = [batch_size, n_heads, Constants.CTX_LEN_VLM, d_head]
+    return padding_shape


same comment as above is this new method required? Can't we reuse existing one?

ochougul · 2025-02-06T05:44:14Z

+        # InternVL
+        "InternVLChatModel": QEffInternVLChatModel,
+        "InternVisionEmbeddings": QEffInternVisionEmbeddings,


please create different transform as mentioned above and separate this dict

ochougul · 2025-02-06T05:45:13Z

+        model_config["n_layer_text"] = 1
+        model_config["n_layer_vision"] = 1


this should not go in library code. It is allowed only in tests

Signed-off-by: Dipankar Sarkar <quic_dipankar@quicinc.com>

quic-amitraj · 2025-02-14T11:13:41Z

Already addressed in #267

qcdipankar added 3 commits February 5, 2025 19:23

VLM Pipeline for onboarding of VLMs

da68b09

Signed-off-by: Dipankar Sarkar <quic_dipankar@quicinc.com>

VLM Pipeline for onboarding of VLMs

e2199e5

Signed-off-by: Dipankar Sarkar <quic_dipankar@quicinc.com>

Change of conversation import over files

9efb97e

Signed-off-by: Dipankar Sarkar <quic_dipankar@quicinc.com>

qcdipankar added enhancement New feature or request bugfix verified labels Feb 6, 2025

qcdipankar self-assigned this Feb 6, 2025

qcdipankar requested review from ochougul and quic-rishinr as code owners February 6, 2025 01:18

qcdipankar requested review from anujgupt-github, quic-akuruvil, quic-amitraj, quic-dhirajku and vbaddi February 6, 2025 01:25

quic-akuruvil reviewed Feb 6, 2025

View reviewed changes

anujgupt-github removed the bugfix label Feb 6, 2025

ochougul requested changes Feb 6, 2025

View reviewed changes

Fixing of intern full model output

f55503a

Signed-off-by: Dipankar Sarkar <quic_dipankar@quicinc.com>

quic-amitraj closed this Feb 14, 2025

		model_config["n_layer_text"] = 1
		model_config["n_layer_vision"] = 1

Conversation

qcdipankar commented Feb 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ochougul left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

quic-amitraj commented Feb 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

qcdipankar commented Feb 6, 2025 •

edited

Loading