Skip to content

Conversation

@NicolaLancellotti
Copy link
Contributor

@NicolaLancellotti NicolaLancellotti commented Apr 4, 2023

Currently, only per-layer quantization is supported in the Arm Compute Library runtime. However, there is no check that prevents the offloading of per-channel quantized operators, as a consequence, tvm fails during inference if such operators are found.

This patch adds a check that prevents offloading per-channel quantized operators.

@tvm-bot
Copy link
Collaborator

tvm-bot commented Apr 4, 2023

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

Generated by tvm-bot

@NicolaLancellotti NicolaLancellotti changed the title [ACL] Prevent offloading of per-channel quantized operators. [ACL] Prevent offloading of per-channel quantized operators Apr 4, 2023
@NicolaLancellotti NicolaLancellotti force-pushed the acl/prevent-per-channel-quantizatio-offload branch from a16b15d to 21454d3 Compare April 4, 2023 13:32
Currently, only per-layer quantization is supported in the
Arm Compute Library runtime. However, there is no check that
prevents the offloading of per-channel quantized operators,
as a consequence, tvm fails during inference if such operators
are found.
@NicolaLancellotti NicolaLancellotti force-pushed the acl/prevent-per-channel-quantizatio-offload branch from 21454d3 to d5fe3be Compare April 5, 2023 07:35
Copy link
Contributor

@lhutton1 lhutton1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @NicolaLancellotti, LGTM!

test_qnn_dense()
test_codegen_dense()
test_codegen_qnn_dense()
test_codegen_qnn_dense_per_channel_quantization()
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No need to add now, we should probably convert to the:

if __name__ == "__main__":
    tvm.testing.main()

format when we get a another chance

@lhutton1 lhutton1 merged commit e51ba29 into apache:main Apr 5, 2023
@lhutton1
Copy link
Contributor

lhutton1 commented Apr 5, 2023

Thanks @NicolaLancellotti!

@NicolaLancellotti NicolaLancellotti deleted the acl/prevent-per-channel-quantizatio-offload branch May 25, 2023 18:20
@NicolaLancellotti NicolaLancellotti restored the acl/prevent-per-channel-quantizatio-offload branch October 13, 2023 17:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants