Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
404 commits
Select commit Hold shift + click to select a range
049c9fc
tensor empty strided
Gasoonjia Sep 28, 2025
b5308f5
[Cadence] move ETDump to OSS (#14616)
pytorchbot Sep 29, 2025
b7ac647
Arm Backend: Expose Vela's Debug Database (#14511)
jmahbs Sep 29, 2025
d0f486a
Make _get_program_from_buffer work for bundled programs take 2 (#14503)
Erik-Lundell Sep 29, 2025
9602b2e
Arm backend: Add chronological dependencies for passes (#14578)
martinlsm Sep 29, 2025
ac2c073
Arm backend: Add TOSA dialect op for RESIZE (#14513)
oscarandersson8218 Sep 29, 2025
668e730
Update add-unanswered-to-project.yml to include github-token with nec…
nil-is-all Sep 29, 2025
d09dd79
[Backend Tester] Migrate to pytest (#14456)
GregoryComer Sep 29, 2025
2bf2de3
Bump tokenizers to b007644 (#14638)
GregoryComer Sep 29, 2025
9e6ad61
Fix buck cell for xplat builds.
hsharma35 Sep 29, 2025
6933889
update torchtune pin (#14662)
lucylq Sep 29, 2025
2d54207
Make extension_runner_util install relative (#14465)
AdrianLundell Sep 29, 2025
601ddda
[Docs] Update QNN SDK version and add new library pushes (#14659)
cccclai Sep 29, 2025
84f0c7d
Embedding quant unification
metascroy Sep 29, 2025
2b20016
[ET-VK] Statically quantized convolutions (#14668)
pytorchbot Sep 29, 2025
f5e049d
[ET-VK] AOT logic for quantized conv2d (#14669)
pytorchbot Sep 29, 2025
6f9c5b1
[ET-VK] Statically quantized add (#14670)
pytorchbot Sep 29, 2025
73b3303
Update LLMTests with new APIs (#14665)
shoumikhin Sep 29, 2025
181ed4d
delete qnnpack backend
JacobSzwejbka Sep 30, 2025
68b2d3c
Rope custom op
DrJessop Sep 30, 2025
ebf4c12
Replace conv_transpose optional bias with zero bias
ethansfng Sep 30, 2025
db8d04f
[multimodal] Allow generate and prefill to take move sematics (#14643)
larryliu0820 Sep 30, 2025
f7c009e
Rescale add int16 correctly (#14645)
3l1 Sep 30, 2025
87d31af
Support convolution
DrJessop Sep 30, 2025
f662cf5
Qualcomm AI Engine Direct - Fixed the wrong size for memmove in kv_ma…
shewu-quic Sep 30, 2025
5d29a7d
Add torchao conversion (#14545)
metascroy Sep 30, 2025
e45f680
tensor destroy (#14698)
pytorchbot Sep 30, 2025
52330d5
Update add-unanswered-to-project.yml to exclude updated list of contr…
nil-is-all Sep 30, 2025
9a03416
Migrate to pytorch_sphinx_theme2 (#14122)
svekars Sep 30, 2025
41379fb
aoti_torch_create_tensor_from_blob_v2 (#14700)
larryliu0820 Sep 30, 2025
eca4fc6
aoti_torch_create_tensor_from_blob_v2
Gasoonjia Sep 30, 2025
d4d24ec
Adding mixed quantization support
mgiordy Sep 30, 2025
79a0240
Update add-unanswered-to-project.yml to edit github username (#14702)
nil-is-all Sep 30, 2025
44972ad
update gcc gh runner (#14699)
lucylq Sep 30, 2025
696bf19
aoti_torch__reinterpret_tensor (#14688)
larryliu0820 Sep 30, 2025
7df7f5a
aoti_torch__reinterpret_tensor
Gasoonjia Sep 30, 2025
e852066
Move passes which replace the quant/dequant to optimized cadence ops …
eigen-k Sep 30, 2025
aeed916
Rescale sub int16 correctly
3l1 Sep 30, 2025
65100f6
aoti_torch_copy_ (#14689)
larryliu0820 Sep 30, 2025
8106204
Added DimorderOpsRevertPass to Openvino backend
suryasidd Sep 30, 2025
8f7d045
aoti_torch_copy_
Gasoonjia Sep 30, 2025
a731206
Adding avgpool2d
DrJessop Sep 30, 2025
04ca3f3
Merge remote-tracking branch 'cavus/main' into openvino_llama_support
suryasidd Sep 30, 2025
62f74a8
Merge branch 'main' into openvino_llama_support
suryasidd Sep 30, 2025
258bce3
use ET_CHECK macro for sanity checks in memory shim layer (#14690)
larryliu0820 Sep 30, 2025
0b17bd2
use ET_CHECK macro for sanity checks in memory shim layer
Gasoonjia Oct 1, 2025
943e34a
Update conv replacement pass + tests.
hsharma35 Oct 1, 2025
0ca5e75
Add Hifi compiled runtime
JakeStevens Oct 1, 2025
b100c95
Update EVal Payload to use pointers for larger objects
JakeStevens Oct 1, 2025
6a238e3
NXP backend: Add infrastructure for context dependant partitioning (#…
MartinPavella Oct 1, 2025
0786faa
[aoti-cuda] Directly pass user input placeholders to torch._inductor.…
larryliu0820 Oct 1, 2025
c1d0d3e
NXP backend: Remove residual onnx references (#14457)
MartinPavella Oct 1, 2025
ce6e2cf
NXP Backend: Update Neutron Software to version SDK_25.09 (#14591)
robert-kalmar Oct 1, 2025
421539e
Module support for multiple ptd files (#14713)
pytorchbot Oct 1, 2025
b265324
[ET-VK] Optimize conv2d s1p0 (#14187)
alexdean08 Oct 1, 2025
ffd27ca
NXP backend: Extend tests for linear addmm mm converters add mm quant…
roman-janik-nxp Oct 1, 2025
4a644b7
Arm backend: Add docstrings for operator_support/pool_2d_support.py (…
Sebastian-Larsson Oct 1, 2025
70bffae
Arm backend: Use default run_on_fvp=True unless explicitly set to False
SaoirseARM Oct 1, 2025
07dcd95
Arm Backend: Do not use None in Slice
digantdesai Oct 1, 2025
d95143e
refactor:(samsung backend): replace pkg_resources with importlib.reso…
onuralpszr Oct 1, 2025
eaf0e17
Fixed linter issues
suryasidd Oct 1, 2025
15f5e23
Merge branch 'main' into openvino_llama_support
suryasidd Oct 1, 2025
19be2a3
Try to get nightly wheel build work with qnn (#14633)
cccclai Oct 1, 2025
7ed9266
Move to ProxyValue instead of FakeTensor weights.
hsharma35 Oct 1, 2025
a4ac70d
Disable nxp tests (#14730)
abhinaykukkadapu Oct 1, 2025
649f92d
Arm backend: Correct type annotations in aot_arm_compiler (#14627)
martinlsm Oct 1, 2025
871fe39
Arm backend: Update full quantization annotation (#14585)
oscarandersson8218 Oct 1, 2025
0081bef
Arm backend: Add complie spec factories (#14376)
Erik-Lundell Oct 1, 2025
0cd8256
Arm backend: Add docstrings for operator_support/convolution_support.…
Sebastian-Larsson Oct 1, 2025
96dfa9c
Add pybindings for bpte and ptd file
lucylq Oct 1, 2025
b1309e7
Aoti support multi method (#14715)
larryliu0820 Oct 1, 2025
426b701
Arm backend: Backend test TOSA FP, INT and Ethos-U55/U85 (#14653)
zingo Oct 1, 2025
d4f208d
Android set different maven package names of flavors (#14674)
kirklandsign Oct 2, 2025
e608a21
[Backend Tester] Update README (#14739)
GregoryComer Oct 2, 2025
fb66fb3
NXP Backend: Add codeowner for the NXP Backend (#14723)
robert-kalmar Oct 2, 2025
baaaa86
Add transposed convolution
DrJessop Oct 2, 2025
9ab5592
support qnn mean (dim=None) (#14675)
cccclai Oct 2, 2025
f24351a
Update mul int16 test
3l1 Oct 2, 2025
499ce50
Arm backend: Add VGF tests to StableDiffusion module tests (#14655)
YufengShi-dudu Oct 2, 2025
edf6927
NXP backend: Improve Neutron targets handling (#14718)
StrycekSimon Oct 2, 2025
0145604
Arm Backend: Add tests for stack.default (#14623)
agrima1304 Oct 2, 2025
4372a14
Fix const prop pass when a const prop tensor has zero stride, make it…
abhinaykukkadapu Oct 2, 2025
3b358d5
Merge branch 'main' into openvino_llama_support
suryasidd Oct 2, 2025
0882c9b
Qualcomm AI Engine Direct - GA Static Gemma-2b-instruct (#14459)
DannyYuyang-quic Oct 2, 2025
deb42f2
update lama export DS specs to be more accurate.
laithsakka Oct 2, 2025
19258d2
update tokenizer pin (#14751)
JacobSzwejbka Oct 2, 2025
a1652f9
Fix pyproject.toml license classifier deprecation (#14592)
tmi Oct 2, 2025
53ccfd0
Fix cuda export test failures from #14715 (#14753)
larryliu0820 Oct 2, 2025
c997fe4
Remove explicit device arguments
navsud Oct 3, 2025
54bfd72
Fix Wav2Vec Replace Pass Bug
DrJessop Oct 3, 2025
822a711
Update addmm int16 for Ethos-U85
3l1 Oct 3, 2025
e652746
Use FusedMovingAvgObsFakeQuantize instead of FakeQuantize for faster QAT
navsud Oct 3, 2025
70ea661
Add Phi4 test and fix regex parsing.
shoumikhin Oct 3, 2025
05799c9
NXP backend: added aten.sub operator support (#14514)
novak-vaclav Oct 3, 2025
3557edf
Update MTK tool versions in documents (#14772)
neuropilot-captain Oct 3, 2025
c44c541
Runner support for multiple ptd files (#14758)
pytorchbot Oct 3, 2025
4d681cb
JNI support for multiple ptd files (#14769)
pytorchbot Oct 3, 2025
7116e0a
Tag mutated buffer for AOTI cuda partitioner (#14783)
larryliu0820 Oct 3, 2025
b021fd0
Support im2row
DrJessop Oct 3, 2025
7c7b729
Patch https://github.com/pytorch/executorch/pull/14754 (#14786)
lucylq Oct 3, 2025
0ee1160
Add transposed im2row
DrJessop Oct 4, 2025
0b5a4ab
Update linear -> conv2d int16 for Ethos
3l1 Oct 4, 2025
ca9fc06
[Release Only] Bugfix/fix nxp separable conv test (#14800)
pytorchbot Oct 4, 2025
3f0896a
[ET-VK] Miscellaneous fixes (#14801)
pytorchbot Oct 4, 2025
881915d
Add platforms for all operator library sub-targets.
hsharma35 Oct 4, 2025
3d8b8d1
fix test-huggingface-transformers-* tests (#14752)
cccclai Oct 4, 2025
3b16bc1
Summary: Use javaClassStatic() for class references stored in static …
psiddh Oct 6, 2025
f81e834
Add strict-flag to ExportSession (#14588)
Erik-Lundell Oct 6, 2025
75ebd05
Fix OpenVINO ci (#14784)
suryasidd Oct 6, 2025
9a7fb42
Arm backend: Fix torch.matmul() failures for 2D tensor inputs (#14624)
YufengShi-dudu Oct 6, 2025
ed3fdad
Update extension/llm/tokenizers (#14807)
shoumikhin Oct 6, 2025
815ae92
Update ReplaceSingleElementTensorArgumentsFromFullOpWithScalarPass to…
ethansfng Oct 6, 2025
8c434dd
[Windows] Enable LLM preset in CI (#14805)
GregoryComer Oct 6, 2025
563a5d2
Arm backend: Remove CheckNeedsDecomposition (#14512)
oscarandersson8218 Oct 6, 2025
8484aee
Arm backend: Backend test serializes and uses EthosUQuant on Ethos-U …
zingo Oct 6, 2025
b6bc421
Arm backend: Fix Arm tester issue for inplace ops (#14625)
mansnils Oct 6, 2025
6e7353f
Arm backend: Add 6D tensor and pixel shuffle/unshuffle support (#14626)
mansnils Oct 6, 2025
266cfd0
Arm backend: Add test for monitoring memory allocation (#14657)
perheld Oct 6, 2025
f174974
Arm backend: Remove hello_world in core_software (#14775)
perheld Oct 6, 2025
cf31475
Revert "[Windows] Enable LLM preset in CI (#14805)" (#14823)
GregoryComer Oct 6, 2025
a39866c
Fix op signature for avg_pool2d
DrJessop Oct 6, 2025
bc931e1
Update APP_PATH to point to mv3 directory (#14828)
shoumikhin Oct 6, 2025
270873f
Restructure ET documentation with 'Platform First' model (#14720)
psiddh Oct 6, 2025
d8a2126
Add Gemma 3 test.
shoumikhin Oct 6, 2025
c609f63
Fixed assumption on out_shift for quantized linear
DrJessop Oct 7, 2025
d36bf8c
Run ET-eager on message recall
derekxu Oct 7, 2025
0b748bf
oss et update to support SAR2230P
billmguo Oct 7, 2025
2c603e4
Arm backend: Move rescale ops out of node visitors (#14584)
martinlsm Oct 7, 2025
1b8d380
NXP backend: Add NXP backend tutorial page (#14850)
StrycekSimon Oct 7, 2025
d8e07bd
Add .ptd support to portable executor runner (#14833)
larryliu0820 Oct 7, 2025
0e74a17
Qualcomm AI Engine Direct - Suite Operator Test Support Part 2 (#14848)
winskuo-quic Oct 7, 2025
0bfb61e
Arm backend: Backend test call setup_path.sh (#14846)
zingo Oct 7, 2025
4ac04c5
Arm backend: Bump tosa version to remove mlplatform dependencies (#14…
ArmRyan Oct 7, 2025
8ac6300
Arm backend: Change input distribution on resnet18 test (#14815)
gggekov Oct 7, 2025
7d8da19
Arm backend: Mark test in test_bmm.py as flaky (#14748)
martinlsm Oct 7, 2025
e09abea
support argmax/argmin without dim kwargs and fix adaptive_max_pool3d …
cccclai Oct 7, 2025
351d82f
Sweep major CMake files for use of include/lib instead of CMAKE_INSTA…
swolchok Oct 7, 2025
740fe14
Back out "oss et update to support SAR2230P"
cccclai Oct 7, 2025
15a203b
Fix avg_pool2d replace ops pass
DrJessop Oct 7, 2025
5c4d214
link new vision kernel internally
zonglinpeng Oct 7, 2025
5dee222
[ez] Try to fix Samsung CI job (#14866)
SS-JIA Oct 7, 2025
fcd42bc
Update link for working with Large Language Models (#14863)
mergennachin Oct 7, 2025
697078b
[aoti-et] Add cuda delegate runtime code (#14827)
larryliu0820 Oct 7, 2025
bba9d26
Introduce public MergedDataMap
lucylq Oct 8, 2025
8efba17
Merge branch 'main' into openvino_llama_support
suryasidd Oct 8, 2025
fb87fa6
Including mixed quant Linear op in Jarvis
mgiordy Oct 8, 2025
229bbd2
Use defualt runner for OpenVINO backend as well
suryasidd Oct 8, 2025
400b2a5
[aoti-et] Add a voxtral runner and add CI (#14875)
larryliu0820 Oct 8, 2025
ab5fb84
Arm backend: fix meandim when dim = None (#14883)
Erik-Lundell Oct 8, 2025
45bf018
Arm backend: build with NAMED_DATA_MAP=ON for vgf (#14885)
Erik-Lundell Oct 8, 2025
9be3aaa
Arm backend: Support min/max with unset dim. (#14884)
Erik-Lundell Oct 8, 2025
7d2b8c6
Arm backend: Add correction for floor mode (#14776)
wwwind Oct 8, 2025
41b061e
NXP backend: Update user guide and docs Readme (#14852)
roman-janik-nxp Oct 8, 2025
a41cdef
refactor: ♻️ update YOLO12 example doc and code (#14771)
onuralpszr Oct 8, 2025
b88b09c
Arm backend: Add missing attribute in VisualizePass (#14847)
martinlsm Oct 8, 2025
5c25493
Arm backend: Add docstrings for tosa/partitioner.py (#14844)
Sebastian-Larsson Oct 8, 2025
bf3b66c
Arm backend: Add docstrings for operator_support/ethos_u55_support.py…
Sebastian-Larsson Oct 8, 2025
91f1769
Arm backend: Switch torch.tan to torch.max in test_multiple_delegates…
emmakujala Oct 8, 2025
5a6113f
Arm backend: Add TOSA dialect op for MATMUL (#14694)
oscarandersson8218 Oct 8, 2025
a9fe0b4
Cortex_m backend: Add script for building test runner (#14750)
AdrianLundell Oct 8, 2025
5af73eb
Qualcomm AI Engine Direct - Support floor_divide with int input in QN…
winskuo-quic Oct 8, 2025
7c148a7
Add constraints for split_copy test
ethansfng Oct 8, 2025
d677277
Enable named data map extension in CUDA build (#14898)
larryliu0820 Oct 8, 2025
ec56cfa
Gather common remove passes in one list.
eigen-k Oct 8, 2025
5246168
Group-quantized embedding op
DrJessop Oct 8, 2025
1da530d
Build pthreadpool with hidden visibility on Apple (#14838)
GregoryComer Oct 8, 2025
2672dd3
TransformerBlock: support attention skips
sxu Oct 8, 2025
c62cbfe
Arm backend: Remove out of date warning for ethos-u tutorial (#14897)
robell Oct 8, 2025
73c8d8c
Move cuda/runtime/shim/utils to cuda/runtime for better usibility. (#…
pytorchbot Oct 8, 2025
0142a1a
introduce CudaGuard and cudastreamguard (#14914)
pytorchbot Oct 8, 2025
f64c864
Revert D84020397: Group-quantized embedding op (#14915)
DrJessop Oct 8, 2025
0525d9c
Merge pull request #12 from suryasidd/runner_changes
cavusmustafa Oct 8, 2025
f32e9fc
Back FreeableBuffer with int64_t
lucylq Oct 8, 2025
24f67b6
Merge branch 'main' into openvino_llama_support
suryasidd Oct 8, 2025
a26412e
Reapply "Add EXECUTORCH_THREADPOOL_SIZE options, default to u… (#1430…
GregoryComer Oct 9, 2025
09c93d4
Read max context length from the correct ModelArgs field
sxu Oct 9, 2025
38b51aa
print bfloat16 tensor data (#14889)
manuelcandales Oct 9, 2025
6520e06
Make type of logits a template parameter
sxu Oct 9, 2025
698ea79
Qualcomm AI Engine Direct - docs fix (#14881)
DannyYuyang-quic Oct 9, 2025
29b4db8
Including mixed quant Conv1D op in Jarvis
mgiordy Oct 9, 2025
f7f97f7
introduce shim layers for cudaguard and cudastreamguard (#14925)
pytorchbot Oct 9, 2025
2eb8994
Add Voxtral test. (#14918)
shoumikhin Oct 9, 2025
8fbc42c
Arm backend: Unsqueeze rank 0 tensor at vgf runtime (#14856)
ArmRyan Oct 9, 2025
418c584
Use quantizable LSTM in test when flow has quantize=True (#14893)
Erik-Lundell Oct 9, 2025
dda2705
Arm backend: Decompose sub/add with alpha!=1 (#14932)
Erik-Lundell Oct 9, 2025
29b98c3
Arm backend: add new cmake line to vgf tutorial (#14935)
Erik-Lundell Oct 9, 2025
75f968d
Make determinism of channels_last more conservative
kimishpatel Oct 9, 2025
a509431
Update extension/llm/tokenizers to d710a0cf10cfa8cb7ffda33c4e61af6311…
shoumikhin Oct 9, 2025
bdc526b
Qualcomm AI Engine Direct - change the llama tutorial to static llama…
DannyYuyang-quic Oct 9, 2025
d4129b7
Arm backend: Updated how generic evaluator is handled (#14940)
Michiel-Olieslagers Oct 9, 2025
71c8031
Fix iOS demo app package resolution on CI (#14952)
shoumikhin Oct 9, 2025
64b0fd9
Make determinism of channels_last more conservative
kimishpatel Oct 9, 2025
bf977e0
Group-quantized embedding op
DrJessop Oct 9, 2025
84d060a
XNNPACK: Assert on unsupported pass through tensor args
digantdesai Oct 9, 2025
a5d7e5c
[ET-VK] Add Fusing for Conv/Binary Ops, Clamp/Binary Ops, and Clamp/C…
alexdean08 Oct 9, 2025
b6884df
Bump cortex-m size test (#14950)
lucylq Oct 9, 2025
f443ebb
Add overload to create atensor view from TensorPtr.
shoumikhin Oct 9, 2025
fc512fa
Fix typos in docs ahead of GA (#14964)
abhinaykukkadapu Oct 9, 2025
8d51b0f
[ET-VK] Show stack trace in Exception messages via boost if boost is …
pytorchbot Oct 9, 2025
d0827e5
Use merged data map in module (#14966)
pytorchbot Oct 9, 2025
66c3dea
Add a wav loader (#14923)
larryliu0820 Oct 9, 2025
9764269
Pass which replaces torch quantized embedding byte with cadence variant
DrJessop Oct 9, 2025
d39992f
Make HQQ default PTQ quantization in ExecuTorch
metascroy Oct 10, 2025
7f31fd8
Removed support for non-per-tensor quantized relu
DrJessop Oct 10, 2025
8b67236
Enable Exynos backend Quatization (#14464)
Jiseong-oh Oct 10, 2025
cf13b9a
[docs] Fix typo in instructions (#14968)
cccclai Oct 10, 2025
4bdd3df
Allow custom sizes, dim order and strides for tensor view. (#14944)
shoumikhin Oct 10, 2025
c979158
Cortex-M backend: Add mul and linear tests (#14746)
AdrianLundell Oct 10, 2025
7395999
Arm backend: add DeiTTiny evaluator and deterministic shuffled calibr…
tirwu01 Oct 10, 2025
caa0094
backends/cuda: use async malloc/free (#14976)
swolchok Oct 10, 2025
94c892c
Arm backend: Update MLSDK dependencies to use gitlab (#14989)
ArmRyan Oct 10, 2025
3591604
NXP backend: Extend NXP backend docs page, add partitioner and quanti…
roman-janik-nxp Oct 10, 2025
896178e
Fix MSVC ambiguity in make_tensor_ptr (#14991)
shoumikhin Oct 10, 2025
3247c15
Revert "[multimodal] Allow generate and prefill to take move sematics…
larryliu0820 Oct 10, 2025
9b03c13
Add extension_named_data_map to llava (#14973)
lucylq Oct 10, 2025
21557d0
set emulate_precision_casts as true for cuda backend for better accur…
pytorchbot Oct 10, 2025
e16cf17
More fixes to docs, fix broken links and more typos (#14975)
abhinaykukkadapu Oct 10, 2025
e26670b
Make image and audio variables const references (#14999)
larryliu0820 Oct 10, 2025
c4bd450
Revert "Arm backend: Add correction for floor mode" (#14998)
ArmRyan Oct 10, 2025
fca0f38
update etrecrod doc to cover new generation pipeline (#15012)
pytorchbot Oct 10, 2025
3bfd5e0
Promote pyproject beta to production/stable (#14777)
mergennachin Oct 10, 2025
7533df6
use reference link for html doc (#15029)
pytorchbot Oct 10, 2025
09eac16
[aoti-et] Enable multimodal runner for Voxtral on CUDA (#14980)
larryliu0820 Oct 11, 2025
4609cdb
Move RemovePermutesAroundElementwiseOps and RemoveSqueezeViewBeforeEl…
eigen-k Oct 11, 2025
1dc0e0e
Arm backend: Upgrade vela to 4.4.1
oscarandersson8218 Oct 11, 2025
cc6cb83
Add option to specify fake tensor mode for graph and program builders.
hsharma35 Oct 11, 2025
35d431b
Arm backend: Enable parallel building on MLSDK emulation layer (#14993)
ArmRyan Oct 11, 2025
019c8da
Qualcomm AI Engine Direct - increase index_put coverage (#14924)
haowhsu-quic Oct 11, 2025
50a10a2
Updating tests for 16A8W ops which are supported (#14945)
Ninja91 Oct 10, 2025
703d25a
Including mixed quant GRU op in Jarvis
mgiordy Oct 12, 2025
e69700b
Support for batched matmul
DrJessop Oct 12, 2025
f32cdc3
pin bump with better architecture (#15040)
pytorchbot Oct 12, 2025
afd98fe
NXP backend: Add conversion and quantization support for dim_order_op…
StrycekSimon Oct 13, 2025
d00279d
Minor update for Arm README.md (#15045)
robell Oct 13, 2025
82bc4c5
Merge branch 'main' into openvino_llama_support
suryasidd Oct 13, 2025
1a8acf6
Update top-level README.md file (#15049)
mergennachin Oct 13, 2025
f84c423
[Metal] Update aoti_common with additional AOTI functions needed by M…
manuelcandales Oct 13, 2025
626a7d1
Move RemoveCatFromSliceCopyPass to the common section. (#14972)
eigen-k Oct 13, 2025
9560800
Fix documentation link for Core ATen operators (#15050)
mergennachin Oct 13, 2025
6efddba
Support sine operator on XNNPACK (#14711)
GregoryComer Oct 13, 2025
a66ea20
msvc support 1/N (#14970)
JacobSzwejbka Oct 13, 2025
adc4889
Move tensor layout into exir (#14917)
pytorchbot Oct 13, 2025
f19882b
Handle uint types. (#15055)
shoumikhin Oct 13, 2025
b9451c9
Use new logo in ExecuTorch (#14782)
mergennachin Oct 13, 2025
23db0bc
Tensor view keeps original tensor alive. (#15056)
shoumikhin Oct 13, 2025
8876113
Ignore PRs that's empty (#15065)
mergennachin Oct 13, 2025
b9e8126
Export lora weights to sep file (#15061)
lucylq Oct 13, 2025
b18243b
Revert "[ET-VK] Add Fusing for Conv/Binary Ops, Clamp/Binary Ops, and…
JacobSzwejbka Oct 13, 2025
1428d81
Changed quantization scheme
suryasidd Oct 13, 2025
caba225
Merge branch 'main' into openvino_llama_support
suryasidd Oct 13, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
4 changes: 2 additions & 2 deletions .ci/docker/build.sh
Original file line number Diff line number Diff line change
Expand Up @@ -54,13 +54,13 @@ case "${IMAGE_NAME}" in
executorch-ubuntu-22.04-mediatek-sdk)
MEDIATEK_SDK=yes
CLANG_VERSION=12
ANDROID_NDK_VERSION=r27b
ANDROID_NDK_VERSION=r28c
;;
executorch-ubuntu-22.04-clang12-android)
LINTRUNNER=""
CLANG_VERSION=12
# From https://developer.android.com/ndk/downloads
ANDROID_NDK_VERSION=r27b
ANDROID_NDK_VERSION=r28c
;;
*)
echo "Invalid image name ${IMAGE_NAME}"
Expand Down
2 changes: 1 addition & 1 deletion .ci/docker/ci_commit_pins/optimum-executorch.txt
Original file line number Diff line number Diff line change
@@ -1 +1 @@
40b02a2dc61bbf901a2df91719f47c98d65368ec
44d8d54e38c0258357d4e92e1fefe21e845947a3
2 changes: 1 addition & 1 deletion .ci/docker/ci_commit_pins/pytorch.txt
Original file line number Diff line number Diff line change
@@ -1 +1 @@
4d4abec80f03cd8fdefe1d9cb3a60d3690cd777e
cf9d09490c7f6685ec68d5db3acf2e0d73c54d00
21 changes: 12 additions & 9 deletions .ci/docker/requirements-ci.txt
Original file line number Diff line number Diff line change
Expand Up @@ -16,18 +16,21 @@ hypothesis==6.84.2
parameterized==0.9.0

# Doc build requirements, same as https://github.com/pytorch/pytorch/blob/main/.ci/docker/requirements-docs.txt
sphinx==5.3.0
sphinx==7.2.6
sphinxcontrib.katex==0.9.10
breathe==4.36.0 # only if generating C++
exhale==0.3.7 # only if generating C++ docs
docutils==0.18.1,<0.21
sphinx-design==0.6.1
sphinxcontrib-mermaid==1.0.0
myst-parser==3.0.1 # if want to contribute in markdown
sphinx-gallery==0.14.0 # only if hosting interactive tutorials
sphinx-sitemap==2.7.1
sphinx-reredirects==0.1.4
sphinx-gallery==0.14.0
breathe==4.34.0
exhale==0.2.3
docutils==0.16
matplotlib>=3.9.4
sphinx-copybutton==0.5.2
# PyTorch Theme
-e git+https://github.com/pytorch/pytorch_sphinx_theme.git#egg=pytorch_sphinx_theme
myst-parser==0.18.1
sphinx_design==0.4.1
sphinx-copybutton==0.5.0
-e git+https://github.com/pytorch/pytorch_sphinx_theme.git@pytorch_sphinx_theme2#egg=pytorch_sphinx_theme2

# script unit test requirements
yaspin==3.1.0
1 change: 1 addition & 0 deletions .ci/scripts/build-qnn-sdk.sh
Original file line number Diff line number Diff line change
Expand Up @@ -38,6 +38,7 @@ set_up_aot() {
-DEXECUTORCH_BUILD_EXTENSION_EXTENSION_LLM=ON \
-DEXECUTORCH_BUILD_EXTENSION_EXTENSION_LLM_RUNNER=ON \
-DEXECUTORCH_BUILD_EXTENSION_FLAT_TENSOR=ON \
-DEXECUTORCH_BUILD_EXTENSION_NAMED_DATA_MAP=ON \
-DEXECUTORCH_BUILD_EXTENSION_TENSOR=ON \
-DEXECUTORCH_ENABLE_EVENT_TRACER=ON \
-DPYTHON_EXECUTABLE=python3
Expand Down
20 changes: 9 additions & 11 deletions .ci/scripts/setup-openvino.sh
Original file line number Diff line number Diff line change
Expand Up @@ -10,19 +10,17 @@ set -ex
# shellcheck source=/dev/null
source "$(dirname "${BASH_SOURCE[0]}")/utils.sh"

git clone https://github.com/openvinotoolkit/openvino.git
cd openvino && git checkout releases/2025/1
git submodule update --init --recursive
sudo ./install_build_dependencies.sh
mkdir build && cd build
cmake .. -DCMAKE_BUILD_TYPE=Release -DENABLE_PYTHON=ON
make -j$(nproc)
# Download and install OpenVINO from release packages
OPENVINO_VERSION="2025.3"
OPENVINO_BUILD="2025.3.0.19807.44526285f24"
OPENVINO_URL="https://storage.openvinotoolkit.org/repositories/openvino/packages/${OPENVINO_VERSION}/linux/openvino_toolkit_ubuntu22_${OPENVINO_BUILD}_x86_64.tgz"

cd ..
cmake --install build --prefix dist
curl -Lo /tmp/openvino_toolkit.tgz --retry 3 --fail ${OPENVINO_URL}
tar -xzf /tmp/openvino_toolkit.tgz
mv openvino_toolkit_ubuntu22_${OPENVINO_BUILD}_x86_64 openvino

source dist/setupvars.sh
cd ../backends/openvino
source openvino/setupvars.sh
cd backends/openvino
pip install -r requirements.txt
cd scripts
./openvino_build.sh --enable_python
8 changes: 4 additions & 4 deletions .ci/scripts/setup-samsung-linux-deps.sh
Original file line number Diff line number Diff line change
Expand Up @@ -11,9 +11,9 @@ set -ex

download_ai_lite_core() {
API_BASE="https://soc-developer.semiconductor.samsung.com/api/v1/resource/ai-litecore/download"
API_KEY="kn10SoSY3hkC-9Qny5TqD2mnqVrlupv3krnjLeBt5cY"
API_KEY=$SAMSUNG_AI_LITECORE_KEY

VERSION="0.5"
VERSION="0.7"
OS_NAME="Ubuntu 22.04"
OUT_FILE="/tmp/exynos-ai-litecore-v${VERSION}.tar.gz"
TARGET_PATH="/tmp/exynos_ai_lite_core"
Expand Down Expand Up @@ -52,7 +52,7 @@ download_ai_lite_core() {
install_enn_backend() {
NDK_INSTALLATION_DIR=/opt/ndk
rm -rf "${NDK_INSTALLATION_DIR}" && sudo mkdir -p "${NDK_INSTALLATION_DIR}"
ANDROID_NDK_VERSION=r27b
ANDROID_NDK_VERSION=r28c

# build Exynos backend
export ANDROID_NDK_ROOT=${ANDROID_NDK_ROOT:-/opt/ndk}
Expand All @@ -62,7 +62,7 @@ install_enn_backend() {
export PYTHONPATH=${PYTHONPATH:-}:${EXECUTORCH_ROOT}/..
}

AI_LITE_CORE_VERSION=0.5.0
AI_LITE_CORE_VERSION=0.7.0

download_ai_lite_core ${AI_LITE_CORE_VERSION}
install_enn_backend
95 changes: 95 additions & 0 deletions .ci/scripts/test-cuda-build.sh
Original file line number Diff line number Diff line change
@@ -0,0 +1,95 @@
#!/bin/bash
# Copyright (c) Meta Platforms, Inc. and affiliates.
# All rights reserved.
#
# This source code is licensed under the BSD-style license found in the
# LICENSE file in the root directory of this source tree.

set -exu

CUDA_VERSION=${1:-"12.6"}

echo "=== Testing ExecuTorch CUDA ${CUDA_VERSION} Build ==="

# Function to build and test ExecuTorch with CUDA support
test_executorch_cuda_build() {
local cuda_version=$1

echo "Building ExecuTorch with CUDA ${cuda_version} support..."
echo "ExecuTorch will automatically detect CUDA and install appropriate PyTorch wheel"

# Check available resources before starting
echo "=== System Information ==="
echo "Available memory: $(free -h | grep Mem | awk '{print $2}')"
echo "Available disk space: $(df -h . | tail -1 | awk '{print $4}')"
echo "CPU cores: $(nproc)"
echo "CUDA version check:"
nvcc --version || echo "nvcc not found"
nvidia-smi || echo "nvidia-smi not found"

# Set CMAKE_ARGS to enable CUDA build - ExecuTorch will handle PyTorch installation automatically
export CMAKE_ARGS="-DEXECUTORCH_BUILD_CUDA=ON"

echo "=== Starting ExecuTorch Installation ==="
# Install ExecuTorch with CUDA support with timeout and error handling
timeout 5400 ./install_executorch.sh || {
local exit_code=$?
echo "ERROR: install_executorch.sh failed with exit code: $exit_code"
if [ $exit_code -eq 124 ]; then
echo "ERROR: Installation timed out after 90 minutes"
fi
exit $exit_code
}

echo "SUCCESS: ExecuTorch CUDA build completed"

# Verify the installation
echo "=== Verifying ExecuTorch CUDA Installation ==="

# Test that ExecuTorch was built successfully
python -c "
import executorch
print('SUCCESS: ExecuTorch imported successfully')
"

# Test CUDA availability and show details
python -c "
try:
import torch
print('INFO: PyTorch version:', torch.__version__)
print('INFO: CUDA available:', torch.cuda.is_available())

if torch.cuda.is_available():
print('SUCCESS: CUDA is available for ExecuTorch')
print('INFO: CUDA version:', torch.version.cuda)
print('INFO: GPU device count:', torch.cuda.device_count())
print('INFO: Current GPU device:', torch.cuda.current_device())
print('INFO: GPU device name:', torch.cuda.get_device_name())

# Test basic CUDA tensor operation
device = torch.device('cuda')
x = torch.randn(10, 10).to(device)
y = torch.randn(10, 10).to(device)
z = torch.mm(x, y)
print('SUCCESS: CUDA tensor operation completed on device:', z.device)
print('INFO: Result tensor shape:', z.shape)

print('SUCCESS: ExecuTorch CUDA integration verified')
else:
print('WARNING: CUDA not detected, but ExecuTorch built successfully')
exit(1)
except Exception as e:
print('ERROR: ExecuTorch CUDA test failed:', e)
exit(1)
"

echo "SUCCESS: ExecuTorch CUDA ${cuda_version} build and verification completed successfully"
}

# Main execution
echo "Current working directory: $(pwd)"
echo "Directory contents:"
ls -la

# Run the CUDA build test
test_executorch_cuda_build "${CUDA_VERSION}"
35 changes: 28 additions & 7 deletions .ci/scripts/test_backend_linux.sh → .ci/scripts/test_backend.sh
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
#!/usr/bin/env bash
# Copyright (c) Meta Platforms, Inc. and affiliates.
# All rights reserved.
# Copyright 2025 Arm Limited and/or its affiliates.
#
# This source code is licensed under the BSD-style license found in the
# LICENSE file in the root directory of this source tree.
Expand All @@ -10,16 +11,26 @@ SUITE=$1
FLOW=$2
ARTIFACT_DIR=$3

REPORT_FILE="$ARTIFACT_DIR/test-report-$FLOW-$SUITE.csv"
REPORT_FILE="$ARTIFACT_DIR/test-report-$FLOW-$SUITE.json"

echo "Running backend test job for suite $SUITE, flow $FLOW."
echo "Saving job artifacts to $ARTIFACT_DIR."

# The generic Linux job chooses to use base env, not the one setup by the image
eval "$(conda shell.bash hook)"
CONDA_ENV=$(conda env list --json | jq -r ".envs | .[-1]")
conda activate "${CONDA_ENV}"

if [[ "$(uname)" == "Darwin" ]]; then
bash .ci/scripts/setup-conda.sh
eval "$(conda shell.bash hook)"
CONDA_RUN_CMD="${CONDA_RUN} --no-capture-output"
${CONDA_RUN_CMD} pip install awscli==1.37.21
IS_MACOS=1
else
CONDA_RUN_CMD=""
IS_MACOS=0
fi

export PYTHON_EXECUTABLE=python

# CMake options to use, in addition to the defaults.
Expand Down Expand Up @@ -48,13 +59,23 @@ fi
if [[ "$FLOW" == *arm* ]]; then
# Setup ARM deps.
.ci/scripts/setup-arm-baremetal-tools.sh
source examples/arm/ethos-u-scratch/setup_path.sh

if [[ "$FLOW" == *ethos_u* ]]; then
# Prepare a test runner binary that can run on the Corstone-3x0 FVPs
backends/arm/scripts/build_executorch.sh
backends/arm/test/setup_testing.sh
fi
fi

# We need the runner to test the built library.
PYTHON_EXECUTABLE=python CMAKE_ARGS="$EXTRA_BUILD_ARGS" .ci/scripts/setup-linux.sh --build-tool cmake --build-mode Release --editable true
if [[ $IS_MACOS -eq 1 ]]; then
SETUP_SCRIPT=.ci/scripts/setup-macos.sh
else
SETUP_SCRIPT=.ci/scripts/setup-linux.sh
fi
CMAKE_ARGS="$EXTRA_BUILD_ARGS" ${CONDA_RUN_CMD} $SETUP_SCRIPT --build-tool cmake --build-mode Release --editable true

EXIT_CODE=0
python -m executorch.backends.test.suite.runner $SUITE --flow $FLOW --report "$REPORT_FILE" || EXIT_CODE=$?

${CONDA_RUN_CMD} pytest -c /dev/nul -n auto backends/test/suite/$SUITE/ -m flow_$FLOW --json-report --json-report-file="$REPORT_FILE" || EXIT_CODE=$?
# Generate markdown summary.
python -m executorch.backends.test.suite.generate_markdown_summary "$REPORT_FILE" > ${GITHUB_STEP_SUMMARY:-"step_summary.md"} --exit-code $EXIT_CODE
${CONDA_RUN_CMD} python -m executorch.backends.test.suite.generate_markdown_summary_json "$REPORT_FILE" > ${GITHUB_STEP_SUMMARY:-"step_summary.md"} --exit-code $EXIT_CODE
30 changes: 0 additions & 30 deletions .ci/scripts/test_backend_macos.sh

This file was deleted.

Loading
Loading