[QNN Quant] Ensure 16bit tensor quant overrides set MS domain by adrianlizarraga · Pull Request #19684 · microsoft/onnxruntime

adrianlizarraga · 2024-02-28T01:19:51Z

Description

Ensures that DQ and Q ops use the msft domain if tensor quantization overrides specify 16-bit integer types.

Motivation and Context

ONNX does not yet support 16bit integer types for QuantizeLinear and DequantizeLinear ops (coming soon). For now, DQ/Q ops must use the MSFT domain.

We have to also check if tensor quantization overrides force the use of 16-bit quantization types. If so, we must correctly set the domain for Q/DQ ops.

onnxruntime/python/tools/quantization/onnx_quantizer.py

snnn · 2024-02-28T01:40:56Z

/azp run Windows GPU TensorRT CI Pipeline

azure-pipelines · 2024-02-28T01:41:04Z

Azure Pipelines successfully started running 1 pipeline(s).

…rides

…oft#19684) ### Description Ensures that DQ and Q ops use the msft domain if tensor quantization overrides specify 16-bit integer types. ### Motivation and Context ONNX does not yet support 16bit integer types for QuantizeLinear and DequantizeLinear ops (coming soon). For now, DQ/Q ops must use the MSFT domain. We have to also check if tensor quantization overrides force the use of 16-bit quantization types. If so, we must correctly set the domain for Q/DQ ops.

Ensure 16bit tensor quant overrides set MS domain

3aaa783

adrianlizarraga commented Feb 28, 2024

View reviewed changes

onnxruntime/python/tools/quantization/onnx_quantizer.py Show resolved Hide resolved

adrianlizarraga requested a review from xadupre February 28, 2024 01:23

adrianlizarraga marked this pull request as ready for review February 28, 2024 01:23

Merge branch 'main' into adrianl/set-ms-domain-from-tensor-quant-over…

3b3468c

…rides

adrianlizarraga requested review from HectorSVC and jywu-msft February 28, 2024 16:33

xadupre approved these changes Feb 28, 2024

View reviewed changes

Merge latest main commits to fix pipeline

a5140c0

adrianlizarraga merged commit c1bf7fc into main Feb 29, 2024

adrianlizarraga deleted the adrianl/set-ms-domain-from-tensor-quant-overrides branch February 29, 2024 09:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QNN Quant] Ensure 16bit tensor quant overrides set MS domain#19684

[QNN Quant] Ensure 16bit tensor quant overrides set MS domain#19684
adrianlizarraga merged 3 commits intomainfrom
adrianl/set-ms-domain-from-tensor-quant-overrides

adrianlizarraga commented Feb 28, 2024 •

edited

Loading

Uh oh!

Uh oh!

snnn commented Feb 28, 2024

Uh oh!

azure-pipelines bot commented Feb 28, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

adrianlizarraga commented Feb 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Uh oh!

Uh oh!

snnn commented Feb 28, 2024

Uh oh!

azure-pipelines bot commented Feb 28, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

adrianlizarraga commented Feb 28, 2024 •

edited

Loading