[ACL] Prevent offloading of per-channel quantized operators #14484

NicolaLancellotti · 2023-04-04T13:28:38Z

Currently, only per-layer quantization is supported in the Arm Compute Library runtime. However, there is no check that prevents the offloading of per-channel quantized operators, as a consequence, tvm fails during inference if such operators are found.

This patch adds a check that prevents offloading per-channel quantized operators.

tvm-bot · 2023-04-04T13:28:43Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @ashutosh-arm, @leandron, @lhutton1 _{See #10317 for details}

_{Generated by tvm-bot}

Currently, only per-layer quantization is supported in the Arm Compute Library runtime. However, there is no check that prevents the offloading of per-channel quantized operators, as a consequence, tvm fails during inference if such operators are found.

lhutton1

Thanks @NicolaLancellotti, LGTM!

lhutton1 · 2023-04-05T12:32:46Z

tests/python/contrib/test_arm_compute_lib/test_dense.py

    test_qnn_dense()
    test_codegen_dense()
    test_codegen_qnn_dense()
+    test_codegen_qnn_dense_per_channel_quantization()


No need to add now, we should probably convert to the:

if __name__ == "__main__": tvm.testing.main()

format when we get a another chance

lhutton1 · 2023-04-05T14:32:57Z

Thanks @NicolaLancellotti!

NicolaLancellotti changed the title ~~[ACL] Prevent offloading of per-channel quantized operators.~~ [ACL] Prevent offloading of per-channel quantized operators Apr 4, 2023

NicolaLancellotti force-pushed the acl/prevent-per-channel-quantizatio-offload branch from a16b15d to 21454d3 Compare April 4, 2023 13:32

NicolaLancellotti force-pushed the acl/prevent-per-channel-quantizatio-offload branch from 21454d3 to d5fe3be Compare April 5, 2023 07:35

lhutton1 approved these changes Apr 5, 2023

View reviewed changes

lhutton1 merged commit e51ba29 into apache:main Apr 5, 2023

ysh329 mentioned this pull request Apr 17, 2023

[Release] v0.12.0 Release Candidate Notes #14645

Closed

NicolaLancellotti deleted the acl/prevent-per-channel-quantizatio-offload branch May 25, 2023 18:20

NicolaLancellotti restored the acl/prevent-per-channel-quantizatio-offload branch October 13, 2023 17:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ACL] Prevent offloading of per-channel quantized operators #14484

[ACL] Prevent offloading of per-channel quantized operators #14484

Uh oh!

NicolaLancellotti commented Apr 4, 2023 •

edited

Loading

Uh oh!

tvm-bot commented Apr 4, 2023

Uh oh!

lhutton1 left a comment

Uh oh!

lhutton1 Apr 5, 2023

Uh oh!

lhutton1 commented Apr 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[ACL] Prevent offloading of per-channel quantized operators #14484

[ACL] Prevent offloading of per-channel quantized operators #14484

Uh oh!

Conversation

NicolaLancellotti commented Apr 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tvm-bot commented Apr 4, 2023

Uh oh!

lhutton1 left a comment

Choose a reason for hiding this comment

Uh oh!

lhutton1 Apr 5, 2023

Choose a reason for hiding this comment

Uh oh!

lhutton1 commented Apr 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

NicolaLancellotti commented Apr 4, 2023 •

edited

Loading