Add SplitTensorsTransform to QEFFAutoModel to prevent >2GB protobuf export issue by quic-rishinr · Pull Request #950 · quic/efficient-transformers

quic-rishinr · 2026-04-28T08:02:44Z

Add SplitTensorsTransform to QEFFAutoModel to prevent >2GB protobuf exports

FP16ClipTransform inlines external weights, causing large embedding
models (e.g. BAAI/bge-reranker-v2-m3) to exceed the 2GB ModelProto
parser limit in the AIC compiler

Adding SplitTensorsTransform to _onnx_transforms spills large
initializers to sidecar *.onnx.data files. Updated existing tests
and added regression tests to verify external data spilling behavior.

…xports Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>

quic-rishinr · 2026-04-29T10:41:20Z

CI-Ready

vbaddi

@quic-rishinr as discussed, lets disable the split transform and fix the onnx.save thing. thanks

Add SplitTensorsTransform to QEFFAutoModel to prevent >2GB protobuf e…

32c3d3c

…xports Signed-off-by: Rishin Raj <rishinr@qti.qualcomm.com>

quic-rishinr requested review from asmigosw and vbaddi April 28, 2026 08:02

vbaddi requested changes Apr 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SplitTensorsTransform to QEFFAutoModel to prevent >2GB protobuf export issue#950

Add SplitTensorsTransform to QEFFAutoModel to prevent >2GB protobuf export issue#950
quic-rishinr wants to merge 1 commit intoquic:mainfrom
quic-rishinr:auto_model_proto_export

quic-rishinr commented Apr 28, 2026 •

edited

Loading

Uh oh!

quic-rishinr commented Apr 29, 2026

Uh oh!

vbaddi left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

quic-rishinr commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

quic-rishinr commented Apr 29, 2026

Uh oh!

vbaddi left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

quic-rishinr commented Apr 28, 2026 •

edited

Loading