Updated reduce sum calculation to use einsum for gpt_oss by asmigosw · Pull Request #754 · quic/efficient-transformers

asmigosw · 2026-01-23T08:39:04Z

The decode‑only GPT‑OSS model was failing when executing subfunctions due to somehow considering a dynamic dim value during reduced‑sum calculation. This caused incorrect tensor reduction and resulted in compilation errors.
The fix replaces the reduction logic with an einsum-based computation, ensuring stable and deterministic summation regardless of dimension shape.

Signed-off-by: asmigosw <asmigosw@qti.qualcomm.com>

The decode‑only GPT‑OSS model was failing when executing subfunctions due to somehow considering a dynamic dim value during reduced‑sum calculation. This caused incorrect tensor reduction and resulted in compilation errors. The fix replaces the reduction logic with an einsum-based computation, ensuring stable and deterministic summation regardless of dimension shape. --------- Signed-off-by: asmigosw <asmigosw@qti.qualcomm.com>

The decode‑only GPT‑OSS model was failing when executing subfunctions due to somehow considering a dynamic dim value during reduced‑sum calculation. This caused incorrect tensor reduction and resulted in compilation errors. The fix replaces the reduction logic with an einsum-based computation, ensuring stable and deterministic summation regardless of dimension shape. --------- Signed-off-by: asmigosw <asmigosw@qti.qualcomm.com> Signed-off-by: Dipankar Sarkar <dipankar@qti.qualcomm.com>

The decode‑only GPT‑OSS model was failing when executing subfunctions due to somehow considering a dynamic dim value during reduced‑sum calculation. This caused incorrect tensor reduction and resulted in compilation errors. The fix replaces the reduction logic with an einsum-based computation, ensuring stable and deterministic summation regardless of dimension shape. --------- Signed-off-by: asmigosw <asmigosw@qti.qualcomm.com> Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>

The decode‑only GPT‑OSS model was failing when executing subfunctions due to somehow considering a dynamic dim value during reduced‑sum calculation. This caused incorrect tensor reduction and resulted in compilation errors. The fix replaces the reduction logic with an einsum-based computation, ensuring stable and deterministic summation regardless of dimension shape. --------- Signed-off-by: asmigosw <asmigosw@qti.qualcomm.com> Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com> Signed-off-by: Sharvari Medhe <smedhe@qti.qualcomm.com>

asmigosw requested review from ochougul, quic-amitraj, quic-hemagnih and quic-rishinr as code owners January 23, 2026 08:39

Updated reduce sum calculation to use einsum

6f6ddc2

Signed-off-by: asmigosw <asmigosw@qti.qualcomm.com>

asmigosw force-pushed the gpt_oss_fix branch from 44a9db1 to 6f6ddc2 Compare January 23, 2026 08:46

Ruff format

b164735

Signed-off-by: asmigosw <asmigosw@qti.qualcomm.com>

vbaddi approved these changes Jan 23, 2026

View reviewed changes

ochougul merged commit 742b7bd into quic:main Jan 27, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated reduce sum calculation to use einsum for gpt_oss#754

Updated reduce sum calculation to use einsum for gpt_oss#754
ochougul merged 2 commits intoquic:mainfrom
asmigosw:gpt_oss_fix

asmigosw commented Jan 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

asmigosw commented Jan 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants