Add ONNX Sub Functions Export Feature for AutoModelForCausalLM by abhishek-singh591 · Pull Request #621 · quic/efficient-transformers

abhishek-singh591 · 2025-11-17T14:28:00Z

ONNX Functions Export Support

Overview

This PR introduces support for exporting ONNX modules as functions, enabling more efficient model compilation and execution on hardware.

Key Changes

Added a new flag use_onnx_subfunctions to control ONNX function export behavior.
Integrated ONNX function export capability into the inference pipeline.

How to Enable ONNX Function Export

Set the flag before running inference (either during export or compile):

model.export(tmp_path, use_onnx_subfunctions=True)

Backward Compatibility

This feature is opt-in and requires an explicit environment variable. Existing workflows remain unaffected when the flag is disabled.

vbaddi · 2025-11-18T04:44:23Z

Let's keep it uniform. Can we rename use_subfunctions to use_onnx_subfunctions?

abhishek-singh591 · 2025-11-18T05:57:04Z

Let's keep it uniform. Can we rename use_subfunctions to use_onnx_subfunctions?

done.

vbaddi · 2025-11-18T08:01:31Z

Let's keep it uniform. Can we rename use_subfunctions to use_onnx_subfunctions?

done.

Please modify the PR commit message and desp. accordingly. thanks

ochougul

review WIP.

Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>

Fix for this JIRA from Imagine team Signed-off-by: Ann Kuruvilla <akuruvil@qti.qualcomm.com> Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>

Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>

quic-rishinr · 2025-11-19T07:51:22Z

@abhishek-singh591 please rebase the PR

vbaddi

LGTM, thanks 👍

Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>

ochougul

Approving. Add todo for CustomOpTransform and merge once CI is passing.

Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>

) This PR introduces support for exporting ONNX modules as **functions**, enabling more efficient model compilation and execution on hardware. - Added a new flag **`use_onnx_subfunctions`** to control ONNX function export behavior. - Integrated ONNX function export capability into the inference pipeline. Set the flag before running inference (either during export or compile): ```bash model.export(tmp_path, use_onnx_subfunctions=True) ``` This feature is **opt-in** and requires an explicit environment variable. Existing workflows remain unaffected when the flag is disabled. --------- Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com> Signed-off-by: Ann Kuruvilla <akuruvil@qti.qualcomm.com> Co-authored-by: quic-akuruvil <quic_akuruvil@quicinc.com> Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>

) # ONNX Functions Export Support ## Overview This PR introduces support for exporting ONNX modules as **functions**, enabling more efficient model compilation and execution on hardware. ## Key Changes - Added a new flag **`use_onnx_subfunctions`** to control ONNX function export behavior. - Integrated ONNX function export capability into the inference pipeline. ## How to Enable ONNX Function Export Set the flag before running inference (either during export or compile): ```bash model.export(tmp_path, use_onnx_subfunctions=True) ``` ## Backward Compatibility This feature is **opt-in** and requires an explicit environment variable. Existing workflows remain unaffected when the flag is disabled. --------- Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com> Signed-off-by: Ann Kuruvilla <akuruvil@qti.qualcomm.com> Co-authored-by: quic-akuruvil <quic_akuruvil@quicinc.com> Signed-off-by: Dhiraj Kumar Sah <dhirajku@qti.qualcomm.com>

abhishek-singh591 requested review from ochougul, quic-amitraj, quic-hemagnih and quic-rishinr as code owners November 17, 2025 14:28

abhishek-singh591 force-pushed the add_subfunction branch from 92f320c to acec54f Compare November 17, 2025 14:30

vbaddi mentioned this pull request Nov 18, 2025

WIP: Feat: Add ONNX Sub Functions Export Feature #613

Closed

vbaddi assigned vbaddi and abhishek-singh591 Nov 18, 2025

vbaddi added 1.21.0 enhancement New feature or request labels Nov 18, 2025

quic-rishinr requested changes Nov 18, 2025

View reviewed changes

ochougul requested changes Nov 18, 2025

View reviewed changes

abhishek-singh591 and others added 5 commits November 19, 2025 06:52

pushed all changes for incoperating subfunction in CausalLM

aafe400

Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>

Changed flag name from use_subfunctions to use_onnx_subfunctions

f0413d6

Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>

Minor fixes

01a9696

Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>

Fix for <end_of_turn> token during inference (quic#622)

219230a

Fix for this JIRA from Imagine team Signed-off-by: Ann Kuruvilla <akuruvil@qti.qualcomm.com> Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>

Addressed all the comments

6daa209

Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>

abhishek-singh591 force-pushed the add_subfunction branch from 65d24bc to 6daa209 Compare November 19, 2025 06:52

abhishek-singh591 requested review from ochougul and quic-rishinr November 19, 2025 07:01

ochougul requested changes Nov 19, 2025

View reviewed changes

Comment thread QEfficient/base/modeling_qeff.py Outdated

Comment thread QEfficient/base/modeling_qeff.py Outdated

Comment thread QEfficient/transformers/models/modeling_auto.py Outdated

Comment thread QEfficient/base/onnx_transforms.py

quic-rishinr requested changes Nov 19, 2025

View reviewed changes

Comment thread QEfficient/base/modeling_qeff.py Outdated

Merge branch 'quic:main' into add_subfunction

d798d1a

vbaddi approved these changes Nov 19, 2025

View reviewed changes

abhishek-singh591 added 2 commits November 19, 2025 10:02

Rebased and some other fixes

50fda72

Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>

Rebased and some other fixes

f75a764

Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>

ochougul approved these changes Nov 19, 2025

View reviewed changes

quic-rishinr approved these changes Nov 19, 2025

View reviewed changes

abhishek-singh591 added 2 commits November 19, 2025 10:50

Changed Custom_ops transform logic now adding all custom_ops proto.

13fe095

Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>

Made Minor fixes

50a2917

Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>

ochougul merged commit 30c334b into quic:main Nov 19, 2025
5 checks passed

ochougul mentioned this pull request Nov 20, 2025

Diffusers support #604

Merged

Conversation

abhishek-singh591 commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

ONNX Functions Export Support

Overview

Key Changes

How to Enable ONNX Function Export

Backward Compatibility

Uh oh!

vbaddi commented Nov 18, 2025

Uh oh!

abhishek-singh591 commented Nov 18, 2025

Uh oh!

vbaddi commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ochougul left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

quic-rishinr commented Nov 19, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vbaddi left a comment

Choose a reason for hiding this comment

Uh oh!

ochougul left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

abhishek-singh591 commented Nov 17, 2025 •

edited

Loading

vbaddi commented Nov 18, 2025 •

edited

Loading