[QNN] Quantize - Fixing the sequence of lowering. #4316

anijain2305 · 2019-11-12T20:02:01Z

Quantize op converts Fp32 tensor to Int8 tensor. This PR delays the rounding and final casting to int as much as possible. Earlier the casting was happening before the addition of zero point.

This will not change the output, as zero points are integers. But this is the right thing to do from maths perspective. No new tests are required for this.

Also, allowing int32 as output type. This is common for quantized bias which is int32 instead of int8.

@zhiics

zhiics

LGTM

* [TOPI][OP] Support Faster-RCNN Proposal OP on CPU (apache#4297) * Support Proposal operator on CPU. * PyLint space issue * PyLint space issue * Pylint singleton-comparison issue * [QNN][Legalize] Specialize for Platforms without any fast Int8 arithmetic units. (apache#4307) * fix error when memory_id is VTA_MEM_ID_OUT (apache#4330) * [CI][DOCKER] Add ONNX runtime dep (apache#4314) * [DOCKER] Add ONNX runtime dep * Improve ci script * [QNN] Quantize - Fixing the sequence of lowering. (apache#4316) * [QNN] Use Int16 upcast in Fallback Conv2D. Fix test names. (apache#4329) * [doc][fix] fix sphinx parsing for pass infra tutorial (apache#4337) * change ci image version (apache#4313) * [Codegen] remove fp16 function override for cuda (apache#4331) * add volatile override back * [codegen] remove fp16 function override for cuda * [CI] Set workspace to be per executor (apache#4336) * [Build][Windows] Fix Windows build by including cctype (apache#4319) * Fix build * dummy change to retrigger CI * dummy change to retrigger ci * dummy change to retrigger ci * Enable hipModuleGetGlobal() (apache#4321) * [Relay][Pass] Add pass to remove unused functions in relay module (apache#4334) * [Relay][Pass] Add pass to remove unused functions in relay module * Add tests * Fix lint * Fix visit order * Add pass argument * Fix * Add support for quant. mul operator in tflite frontend (apache#4283) A test for qnn_mul has to be added when the qnn elemwise tests (apache#4282) get merged. * Add topi.nn.fifo_buffer to TVM doc (apache#4343) * Solve custom model of prelu (apache#4326) * Deprecate NNVM warning msg (apache#4333) * [Contrib] Add MKL DNN option (apache#4323) * [Contrib] Add MKL DNN * update * update * [Relay][Frontend][TF] Fix transpose when axes is not a param (apache#4327) * [Relay][Frontend][TF] Use _infer_value_simulated when axes is not a const to Transpose * uncomment tests * dummy change to retrigger ci * [RUNTIME] Add device query for AMD GcnArch (apache#4341) * add gcnArch query * kGcnArch query for cuda is a no-op * [Test][Relay][Pass] Add test case for lambda lift (apache#4317) * [Relay][Frontend][ONNX] operator support: DepthToSpace, SpaceToDepth (apache#4271) * imp module is deprecated (apache#4275) * [VTA] Bug fix for padded load with large inputs (apache#4293) * bug fix for padded load with large inputs * Update TensorLoad.scala * Update test_vta_insn.py * fix inconsistent tag name (apache#4134) * [CodeGen] Add build config option disable_assert to control whether to generate assert (apache#4340) * Bump up CUDA log version in tophub.py (apache#4347) * Add check to ensure input file was successfully opened in NNVM deploy code demo (apache#4315) * [COMMUNITY] Add DISCLAIMER, KEYS for ASF release (apache#4345) * [COMMUNITY] Add DISCLAIMER, KEYS for ASF release * Add file name spec * [Relay][VM][Interpreter] Enable first-class constructors in VM and interpreter via eta expansion (apache#4218) * Fix constructor pretty printing * Make Module::HasDef name consistent with API * Add VM constructor compilation via eta expansion * Lint * Fix CI * Fix failing test * Address comment * Retrigger CI * Retrigger CI * Update dmlc_tvm_commit_id.txt

[QNN] Quantize - Fixing the sequence of lowering.

9984f65

anijain2305 force-pushed the quantize branch from 544e3a7 to 9984f65 Compare November 13, 2019 19:36

zhiics approved these changes Nov 14, 2019

View reviewed changes

zhiics merged commit fed79b3 into apache:master Nov 14, 2019

anijain2305 deleted the quantize branch November 14, 2019 20:40

zxy844288792 pushed a commit to zxy844288792/tvm that referenced this pull request Nov 15, 2019

[QNN] Quantize - Fixing the sequence of lowering. (apache#4316)

ceea4af

zxy844288792 pushed a commit to zxy844288792/tvm that referenced this pull request Nov 15, 2019

[QNN] Quantize - Fixing the sequence of lowering. (apache#4316)

9ab329e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QNN] Quantize - Fixing the sequence of lowering. #4316

[QNN] Quantize - Fixing the sequence of lowering. #4316

Uh oh!

anijain2305 commented Nov 12, 2019 •

edited

Loading

Uh oh!

zhiics left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[QNN] Quantize - Fixing the sequence of lowering. #4316

[QNN] Quantize - Fixing the sequence of lowering. #4316

Uh oh!

Conversation

anijain2305 commented Nov 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zhiics left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

anijain2305 commented Nov 12, 2019 •

edited

Loading