[OpenVINO] NNCF Data-Aware Compression Algorithms Support for OVQuantizer by anzr299 · Pull Request #16002 · pytorch/executorch

anzr299 · 2025-11-27T08:17:32Z

Summary

This PR introduces the use of nncf.compress_pt2e() API which allows users to pass a quantizer object which is compatible with the Torch AO Quantizers API and the Torch FX model. It then returns the model with weights only compression applied to it along with additional algorithms that can be applied from NNCF like AWQ, Scale Estimation, etc.

pytorch-bot · 2025-11-27T08:17:36Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16002

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 6 New Failures

As of commit 0c82495 with merge base 1550f0c ():

NEW FAILURES - The following jobs have failed:

pull / android / build-llm-demo / linux-job (gh)
At least one of the pre-conditions you specified did not hold
pull / test-samsung-models-linux / linux-job (gh)
RuntimeError: Command docker exec -t a67f38422e888657031e0ffcf3663b29de2f5f22b0d6e4829902f76f22dfea47 /exec failed with exit code 1
pull / test-samsung-quantmodels-linux / linux-job (gh)
RuntimeError: Command docker exec -t 4bd891c83232141a6516b3321d79d0077ba55d796a99d1d910650d9f326516f7 /exec failed with exit code 1
Test CUDA Builds / check-all-cuda-builds (gh)
Process completed with exit code 1.
Test CUDA Builds / test-executorch-cuda-build-12.9 / linux-job (gh)
RuntimeError: Command docker exec -t 2247b0899dff5e2644f1ad61d29e343dad7005da51f190e1025cafe7522aa57e /exec failed with exit code 1
Test CUDA Builds / test-models-cuda (resnet18) / linux-job (gh)
RuntimeError: Command docker exec -t ab7a4311bbd306ffef679e876ac997035f5f1a03808a5acd5b952a5deb2ce93a /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-12-03T09:10:56Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Copilot

Pull request overview

This PR adds support for NNCF data-aware compression algorithms (AWQ and Scale Estimation) to the OpenVINO quantizer. It refactors the quantizer configuration to enable more flexible compression options and introduces a new compression module for LLM calibration.

Key changes:

Replaces the boolean nncf_compression field with two specific algorithm flags: openvino_awq and openvino_scale_estimation
Refactors WEIGHTS_ONLY_COMPRESSION_MODES from a tuple to a dictionary for cleaner mode mapping
Adds new methods to expose weight compression configuration and parameters
Introduces apply_nncf_data_aware_compression function for data-aware LLM compression
Updates NNCF dependency to use the latest version instead of a specific commit

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 11 comments.

Show a summary per file

File	Description
extension/llm/export/config/llm_config.py	Replaces `nncf_compression` with `openvino_awq` and `openvino_scale_estimation` configuration fields
examples/models/llama/export_llama_lib.py	Adds CLI arguments for AWQ and scale estimation, integrates new compression function
backends/openvino/requirements.txt	Updates NNCF dependency to use latest version from main branch
backends/openvino/quantizer/quantizer.py	Refactors compression modes mapping, adds new methods for weight compression config exposure, adds check for null compression configs
backends/openvino/quantizer/llm_compression.py	New file implementing data-aware compression with calibration data generation
backends/openvino/quantizer/init.py	Exports new `apply_nncf_data_aware_compression` function

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/openvino/quantizer/quantizer.py

backends/openvino/quantizer/llm_compression.py

backends/openvino/quantizer/quantizer.py

backends/openvino/quantizer/llm_compression.py

backends/openvino/quantizer/quantizer.py

examples/models/llama/export_llama_lib.py

backends/openvino/quantizer/llm_compression.py

backends/openvino/quantizer/quantizer.py

backends/openvino/requirements.txt

Copilot

Pull request overview

Copilot reviewed 6 out of 6 changed files in this pull request and generated 13 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/openvino/quantizer/quantizer.py

backends/openvino/quantizer/llm_compression.py

backends/openvino/quantizer/quantizer.py

backends/openvino/quantizer/llm_compression.py

examples/models/llama/export_llama_lib.py

Copilot

Pull request overview

Copilot reviewed 6 out of 6 changed files in this pull request and generated 6 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/openvino/quantizer/quantizer.py

examples/models/llama/export_llama_lib.py

backends/openvino/quantizer/quantizer.py

backends/openvino/quantizer/llm_compression.py

examples/models/llama/export_llama_lib.py

Copilot

Pull request overview

Copilot reviewed 6 out of 6 changed files in this pull request and generated 6 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/openvino/quantizer/quantizer.py

backends/openvino/quantizer/llm_compression.py

extension/llm/export/config/llm_config.py

backends/openvino/quantizer/llm_compression.py

Copilot

Pull request overview

Copilot reviewed 6 out of 6 changed files in this pull request and generated 1 comment.

Comments suppressed due to low confidence (1)

extension/llm/export/config/llm_config.py:459

The docstring incorrectly states 'Configures the QNN backend' when this is the OpenvinoConfig class. It should say 'Configures the OpenVINO backend.'

    """
    Configures the QNN backend.
    """

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/openvino/quantizer/llm_compression.py

Copilot

Pull request overview

Copilot reviewed 6 out of 6 changed files in this pull request and generated 5 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/openvino/quantizer/quantizer.py

backends/openvino/quantizer/llm_compression.py

backends/openvino/quantizer/quantizer.py

backends/openvino/quantizer/llm_compression.py

Copilot

Pull request overview

Copilot reviewed 10 out of 10 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/openvino/quantizer/llm_compression.py

Copilot

Pull request overview

Copilot reviewed 10 out of 10 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/openvino/quantizer/llm_compression.py

Copilot

Pull request overview

Copilot reviewed 10 out of 10 changed files in this pull request and generated 6 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

backends/openvino/quantizer/llm_compression.py

backends/openvino/quantizer/quantizer.py

backends/openvino/tests/quantizer/test_llm_compression.py

backends/openvino/quantizer/llm_compression.py

Copilot

Pull request overview

Copilot reviewed 10 out of 10 changed files in this pull request and generated 5 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

examples/models/llama/export_llama_lib.py

extension/llm/export/config/llm_config.py

examples/models/llama/export_llama_lib.py

backends/openvino/tests/quantizer/test_llm_compression.py

backends/openvino/quantizer/quantizer.py

anzr299 · 2026-02-24T14:52:36Z

@mergennachin the new version of nncf is released and I have updated the requirements.txt accordingly

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 27, 2025

anzr299 mentioned this pull request Dec 3, 2025

NNCF Compress PT2E Support cavusmustafa/executorch#14

Closed

anzr299 marked this pull request as ready for review January 8, 2026 14:44

anzr299 requested a review from larryliu0820 as a code owner January 8, 2026 14:44

Copilot AI review requested due to automatic review settings January 8, 2026 14:44

anzr299 requested review from lucylq and mergennachin as code owners January 8, 2026 14:44

anzr299 marked this pull request as draft January 8, 2026 14:45

Copilot started reviewing on behalf of anzr299 January 8, 2026 14:45 View session

Copilot AI reviewed Jan 8, 2026

View reviewed changes

anzr299 marked this pull request as ready for review January 13, 2026 12:49

Copilot AI review requested due to automatic review settings January 13, 2026 12:49

Copilot started reviewing on behalf of anzr299 January 13, 2026 12:49 View session

Copilot AI reviewed Jan 13, 2026

View reviewed changes

Copilot AI review requested due to automatic review settings January 13, 2026 13:03

Copilot started reviewing on behalf of anzr299 January 13, 2026 13:04 View session

Copilot AI reviewed Jan 13, 2026

View reviewed changes

Copilot AI review requested due to automatic review settings January 13, 2026 13:55

Copilot started reviewing on behalf of anzr299 January 13, 2026 13:56 View session

Copilot AI reviewed Jan 13, 2026

View reviewed changes

Copilot AI review requested due to automatic review settings January 15, 2026 08:09

Copilot started reviewing on behalf of anzr299 January 15, 2026 08:10 View session

Copilot AI reviewed Jan 15, 2026

View reviewed changes

backends/openvino/quantizer/llm_compression.py Outdated Show resolved Hide resolved

Copilot AI review requested due to automatic review settings January 16, 2026 16:50

Copilot started reviewing on behalf of anzr299 January 16, 2026 16:51 View session

Copilot AI reviewed Jan 16, 2026

View reviewed changes

Copilot AI review requested due to automatic review settings January 16, 2026 17:50

Copilot started reviewing on behalf of anzr299 January 16, 2026 17:50 View session

anzr299 added 3 commits January 22, 2026 00:06

lint

0d3d681

review changes

12efc70

comprae reference scale values in tests

1236dfc

Copilot AI review requested due to automatic review settings January 22, 2026 12:43

Copilot started reviewing on behalf of anzr299 January 22, 2026 12:43 View session

remove dead code

019b2cc

Copilot AI reviewed Jan 22, 2026

View reviewed changes

backends/openvino/quantizer/llm_compression.py Show resolved Hide resolved

backends/openvino/quantizer/llm_compression.py Show resolved Hide resolved

anzr299 added 4 commits February 3, 2026 13:20

Merge branch 'main' into an/openvino/nncf_compress_pt2e

659a834

lint fixes

562261f

extend test for error

ecd5b8a

lint

d72466d

mergennachin approved these changes Feb 3, 2026

View reviewed changes

Merge branch 'main' into an/openvino/nncf_compress_pt2e

6e349c3

Copilot AI review requested due to automatic review settings February 4, 2026 07:13

Copilot started reviewing on behalf of anzr299 February 4, 2026 07:14 View session

Copilot AI reviewed Feb 4, 2026

View reviewed changes

backends/openvino/quantizer/llm_compression.py Show resolved Hide resolved

backends/openvino/quantizer/llm_compression.py Outdated Show resolved Hide resolved

backends/openvino/quantizer/llm_compression.py Show resolved Hide resolved

anzr299 added 2 commits February 4, 2026 11:36

remove leading space in error message

83f0fb8

Merge branch 'pytorch:main' into an/openvino/nncf_compress_pt2e

42fc491

Copilot AI review requested due to automatic review settings February 6, 2026 13:35

Copilot started reviewing on behalf of anzr299 February 6, 2026 13:36 View session

Copilot AI reviewed Feb 6, 2026

View reviewed changes

anzr299 added 2 commits February 23, 2026 10:35

Merge branch 'pytorch:main' into an/openvino/nncf_compress_pt2e

ba68d56

Merge branch 'pytorch:main' into an/openvino/nncf_compress_pt2e

b1b2fb2

Copilot AI review requested due to automatic review settings February 24, 2026 14:17

Copilot started reviewing on behalf of anzr299 February 24, 2026 14:18 View session

update nncf version to 3.0.0

0093592

Copilot AI reviewed Feb 24, 2026

View reviewed changes

Merge branch 'main' into an/openvino/nncf_compress_pt2e

0c82495

mergennachin merged commit b9c3217 into pytorch:main Feb 25, 2026
238 of 249 checks passed

Conversation

anzr299 commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

pytorch-bot bot commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/16002

❌ 6 New Failures

Uh oh!

github-actions bot commented Dec 3, 2025

This PR needs a release notes: label

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

anzr299 commented Nov 27, 2025 •

edited

Loading

pytorch-bot bot commented Nov 27, 2025 •

edited

Loading

This PR needs a `release notes:` label