Arm backend: Minimal example of pruning#15851
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15851
Note: Links to docs will display an error until the docs builds have been completed. This comment was automatically generated by Dr. CI and updates every 15 minutes. |
There was a problem hiding this comment.
Pull Request Overview
This PR introduces a comprehensive tutorial on neural network pruning for the Arm Ethos-U NPU, demonstrating how to prune a model and measure the resulting performance improvements. The example uses a simple MNIST classifier and shows the complete workflow from training to deployment.
Key changes:
- Adds a Jupyter notebook tutorial demonstrating pruning workflow with PyTorch and ExecuTorch
- Updates performance monitoring to track MAC and Weight Decoder activity for Ethos-U85
- Shows 3x+ inference speedup and significant memory reduction through pruning
Reviewed Changes
Copilot reviewed 2 out of 2 changed files in this pull request and generated 7 comments.
| File | Description |
|---|---|
| examples/arm/pruning_minimal_example.ipynb | Complete tutorial notebook showing how to prune a neural network, quantize it, and deploy to Ethos-U NPU with performance analysis |
| examples/arm/executor_runner/arm_perf_monitor.cpp | Adds MAC_ACTIVE and WD_ACTIVE PMU counter tracking for Ethos-U85 to support detailed performance analysis |
Comments suppressed due to low confidence (1)
examples/arm/pruning_minimal_example.ipynb:1
- The flag '--debug-force-regor' appears to contain a typo. Verify if this should be '--debug-force-regor' or if it's a misspelling of a valid flag like '--debug-force-regen' or '--debug-force-reorder'.
{
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
|
@gggekov the label "topic: not user facing" is not used anymore so no need to set it it was replaced with "release notes: arm" or "release notes: none" |
Explain how to prune a NN and the associated uplift in performance when running on the Ethos-U NPU. Change-Id: Ib68513e5b4cb7ceef280b6fe089985e9948a8140
1f012a9 to
8119d6d
Compare
|
Fails are unrelated |
Explain how to prune a NN and the associated uplift in performance when running on the Ethos-U NPU.
Explain how to prune a NN and the associated uplift in performance when running on the Ethos-U NPU.
cc @freddan80 @per @zingo @oscarandersson8218 @digantdesai