Fix quantization dtypes after ORT PR #18043 by xadupre · Pull Request #881 · microsoft/Olive

xadupre · 2024-01-15T11:18:20Z

Describe your changes

PR microsoft/onnxruntime#18043 (onnxruntime) extends onnxruntime quantization tools to support float16 weights. To do so, it enforces scale and zerop_point to be strongly typed (as numpy.array(single_value, dtype=dtype)). scale type should always be the weight type, and zero_point type the quantized weight type. That convention is checked all along the quantization tools to make sure there is loss of information. This change was made to avoid adding new arguments in many functions to carry this information.

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.

(Optional) Issue link

jambayk · 2024-01-15T22:19:02Z

/azp run

azure-pipelines · 2024-01-15T22:19:14Z

Azure Pipelines successfully started running 1 pipeline(s).

olive/passes/onnx/vitis_ai/quant_utils.py

guotuofeng · 2024-01-16T08:48:06Z

/azp run

azure-pipelines · 2024-01-16T08:48:17Z

Azure Pipelines successfully started running 1 pipeline(s).

olive/passes/onnx/vitis_ai/quant_utils.py

guotuofeng · 2024-01-16T10:27:30Z

/azp run

azure-pipelines · 2024-01-16T10:27:43Z

Azure Pipelines successfully started running 1 pipeline(s).

guotuofeng · 2024-01-16T11:27:11Z

@sheng-xiao, would you please help take a look?

## Describe your changes With PR #881, the CI pipeline for ORT 1.15 is broken. This PR is to fix the issue. ## Checklist before requesting a review - [ ] Add unit tests for this change. - [ ] Make sure all tests can pass. - [ ] Update documents if necessary. - [ ] Lint and apply fixes to your code by running `lintrunner -a` - [ ] Is this a user-facing change? If yes, give a description of this change to be included in the release notes. ## (Optional) Issue link

Fix quantization dtypes after ORT PR #18043

115fcc5

guotuofeng reviewed Jan 16, 2024

View reviewed changes

olive/passes/onnx/vitis_ai/quant_utils.py Outdated Show resolved Hide resolved

fix for ort 1.16, 1.17

4659cfc

guotuofeng reviewed Jan 16, 2024

View reviewed changes

olive/passes/onnx/vitis_ai/quant_utils.py Outdated Show resolved Hide resolved

add a test with Overflow + fix the code

7c5ae4d

guotuofeng approved these changes Jan 17, 2024

View reviewed changes

guotuofeng merged commit aacc65c into microsoft:main Jan 17, 2024

guotuofeng mentioned this pull request Jan 18, 2024

fix vitis CI broken caused by calibrate API changes #886

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix quantization dtypes after ORT PR #18043#881

Fix quantization dtypes after ORT PR #18043#881
guotuofeng merged 3 commits intomicrosoft:mainfrom
xadupre:qu

xadupre commented Jan 15, 2024

Uh oh!

jambayk commented Jan 15, 2024

Uh oh!

azure-pipelines bot commented Jan 15, 2024

Uh oh!

Uh oh!

guotuofeng commented Jan 16, 2024

Uh oh!

azure-pipelines bot commented Jan 16, 2024

Uh oh!

Uh oh!

guotuofeng commented Jan 16, 2024

Uh oh!

azure-pipelines bot commented Jan 16, 2024

Uh oh!

guotuofeng commented Jan 16, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

xadupre commented Jan 15, 2024

Describe your changes

Checklist before requesting a review

(Optional) Issue link

Uh oh!

jambayk commented Jan 15, 2024

Uh oh!

azure-pipelines bot commented Jan 15, 2024

Uh oh!

Uh oh!

guotuofeng commented Jan 16, 2024

Uh oh!

azure-pipelines bot commented Jan 16, 2024

Uh oh!

Uh oh!

guotuofeng commented Jan 16, 2024

Uh oh!

azure-pipelines bot commented Jan 16, 2024

Uh oh!

guotuofeng commented Jan 16, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants