fix eval_llama_qnn custom annotation by DannyYuyang-quic · Pull Request #15953 · pytorch/executorch

DannyYuyang-quic · 2025-11-22T03:40:23Z

Summary

Fix eval_llama_qnn: retrieve custom annotation from quantization recipe

Test plan

python -m executorch.examples.qualcomm.oss_scripts.llama.eval_llama_qnn --decoder_model qwen2_5-0_5b --quant_linear_only --max_seq_length 1024 --ptq 16a4w

pytorch-bot · 2025-11-22T03:40:26Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15953

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

DannyYuyang-quic · 2025-11-22T03:42:32Z

Hi @cccclai,

The previous PR #15807 moved all LLM quantization related configs into the quantization recipe.
As a result, custom_annotation was no longer accessible from the model config(LLMModelConfig), causing the unit test to fail.
https://github.com/pytorch/executorch/actions/runs/19558624238/job/56006215617#step:16:17963

This fix updates eval_llama_qnn.py to retrieve custom_annotation from the quantization recipe instead.

Please have a look, thanks!

DannyYuyang-quic · 2025-11-22T03:43:01Z

@pytorchbot label "release notes: qualcomm"

cccclai · 2025-11-22T17:17:53Z

lint is failing, can you fix it?

DannyYuyang-quic · 2025-11-23T11:14:41Z

lint is failing, can you fix it?

Done, thanks!

cccclai · 2025-11-24T19:23:44Z

Thank you

### Summary Fix eval_llama_qnn: retrieve custom annotation from quantization recipe ### Test plan ``` bash python -m executorch.examples.qualcomm.oss_scripts.llama.eval_llama_qnn --decoder_model qwen2_5-0_5b --quant_linear_only --max_seq_length 1024 --ptq 16a4w ```

DannyYuyang-quic requested a review from cccclai as a code owner November 22, 2025 03:40

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 22, 2025

pytorch-bot bot added the release notes: qualcomm Changes to the Qualcomm backend delegate label Nov 22, 2025

fix eval_llama_qnn custom annotation

3907d55

DannyYuyang-quic force-pushed the dev1/danny/fix_eval_llama_qnn_custom_annotation branch from c6b6ec0 to 3907d55 Compare November 23, 2025 11:14

cccclai approved these changes Nov 24, 2025

View reviewed changes

cccclai merged commit 4d36623 into pytorch:main Nov 24, 2025
138 of 139 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix eval_llama_qnn custom annotation#15953

fix eval_llama_qnn custom annotation#15953
cccclai merged 1 commit intopytorch:mainfrom
CodeLinaro:dev1/danny/fix_eval_llama_qnn_custom_annotation

DannyYuyang-quic commented Nov 22, 2025

Uh oh!

pytorch-bot bot commented Nov 22, 2025

Uh oh!

DannyYuyang-quic commented Nov 22, 2025 •

edited

Loading

Uh oh!

DannyYuyang-quic commented Nov 22, 2025

Uh oh!

cccclai commented Nov 22, 2025

Uh oh!

DannyYuyang-quic commented Nov 23, 2025

Uh oh!

cccclai commented Nov 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DannyYuyang-quic commented Nov 22, 2025

Summary

Test plan

Uh oh!

pytorch-bot bot commented Nov 22, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15953

Uh oh!

DannyYuyang-quic commented Nov 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DannyYuyang-quic commented Nov 22, 2025

Uh oh!

cccclai commented Nov 22, 2025

Uh oh!

DannyYuyang-quic commented Nov 23, 2025

Uh oh!

cccclai commented Nov 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

DannyYuyang-quic commented Nov 22, 2025 •

edited

Loading