Skip to content

[QNN Quant] Ensure 16bit tensor quant overrides set MS domain#19684

Merged
adrianlizarraga merged 3 commits intomainfrom
adrianl/set-ms-domain-from-tensor-quant-overrides
Feb 29, 2024
Merged

[QNN Quant] Ensure 16bit tensor quant overrides set MS domain#19684
adrianlizarraga merged 3 commits intomainfrom
adrianl/set-ms-domain-from-tensor-quant-overrides

Conversation

@adrianlizarraga
Copy link
Contributor

@adrianlizarraga adrianlizarraga commented Feb 28, 2024

Description

Ensures that DQ and Q ops use the msft domain if tensor quantization overrides specify 16-bit integer types.

Motivation and Context

ONNX does not yet support 16bit integer types for QuantizeLinear and DequantizeLinear ops (coming soon). For now, DQ/Q ops must use the MSFT domain.

We have to also check if tensor quantization overrides force the use of 16-bit quantization types. If so, we must correctly set the domain for Q/DQ ops.

@adrianlizarraga adrianlizarraga marked this pull request as ready for review February 28, 2024 01:23
@snnn
Copy link
Contributor

snnn commented Feb 28, 2024

/azp run Windows GPU TensorRT CI Pipeline

@azure-pipelines
Copy link

Azure Pipelines successfully started running 1 pipeline(s).

@adrianlizarraga adrianlizarraga merged commit c1bf7fc into main Feb 29, 2024
@adrianlizarraga adrianlizarraga deleted the adrianl/set-ms-domain-from-tensor-quant-overrides branch February 29, 2024 09:19
zz002 pushed a commit to zz002/onnxruntime that referenced this pull request Mar 7, 2024
…oft#19684)

### Description
Ensures that DQ and Q ops use the msft domain if tensor quantization
overrides specify 16-bit integer types.

### Motivation and Context
ONNX does not yet support 16bit integer types for QuantizeLinear and
DequantizeLinear ops (coming soon). For now, DQ/Q ops must use the MSFT
domain.

We have to also check if tensor quantization overrides force the use of
16-bit quantization types. If so, we must correctly set the domain for
Q/DQ ops.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants