Arm backend: Add 16A8W support and test for cat operation by Ninja91 · Pull Request #13800 · pytorch/executorch

Ninja91 · 2025-08-29T06:43:01Z

Stack from ghstack (oldest at bottom):

Add 16A8W quantization support and test for the cat operation in ExecutorTorch ARM backend.

This follows the pattern established for linear, mul, sigmoid, tanh, slice, and view/transpose operations, extending int16 support to cat operations.

Changes:

Add test_cat_tensor_16a8w_tosa_INT test function
Enable test_cat.py in test targets configuration

The 16A8W configuration uses 16-bit activations with 8-bit weights, enabling higher precision for activations while maintaining weight efficiency.

Differential Revision: D80511455

cc @digantdesai @freddan80 @per @zingo @oscarandersson8218

Add 16A8W quantization support and test for the cat operation in ExecutorTorch ARM backend. This follows the pattern established for linear, mul, sigmoid, tanh, slice, and view/transpose operations, extending int16 support to cat operations. Changes: - Add test_cat_tensor_16a8w_tosa_INT test function - Enable test_cat.py in test targets configuration The 16A8W configuration uses 16-bit activations with 8-bit weights, enabling higher precision for activations while maintaining weight efficiency. Differential Revision: [D80511455](https://our.internmc.facebook.com/intern/diff/D80511455/) [ghstack-poisoned]

pytorch-bot · 2025-08-29T06:43:05Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13800

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures, 7 Unrelated Failures

As of commit 6df79e2 with merge base 1d37845 ():

NEW FAILURES - The following jobs have failed:

trunk / test-llama-runner-mac (fp32, coreml) / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1
trunk / test-qnn-model (fp32, conv_former) / linux-job (gh)
RuntimeError: Command docker exec -t 9d12975041d3707b55e9ce71fa3e828c390ad93b02ac718144be5224d10f77a0 /exec failed with exit code 92
trunk / test-qnn-optimum-model (fp32, bert) / linux-job (gh)
RuntimeError: Command docker exec -t 8467016f2773cc89dae3d5e75f28de24fba420a93006edef00c61dc78db0c869 /exec failed with exit code 92
trunk / test-qnn-optimum-model (fp32, focalnet) / linux-job (gh)
RuntimeError: Command docker exec -t 5071fc612d0dfb11440d0653ff6285e0ba6b5fa31b8b1ff65e619361628ad3e0 /exec failed with exit code 92

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Build Windows Wheels / pytorch/executorch / build-wheel-py3_10-cpu (gh) (trunk failure)
RuntimeError: Failed to install QNN SDK. Please check the logs above.
Build Windows Wheels / pytorch/executorch / upload / upload-wheel-py3_10-cpu (gh) (trunk failure)
pull / test-binary-size-linux-gcc / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-moshi-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-openvino-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-samsung-models-linux / linux-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / test-setup-linux-gcc / linux-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-08-29T06:43:56Z