[Release] v0.15.0 Release Candidate Notes

# Introduction

The TVM community has worked since the v0.14.0 release to deliver the following new exciting improvements! The main tags are below (**bold text is with lots of progress**):

- Community, RFCs
- Adreno, ArmComputeLibrary, Metal, cuda & cutlass & tensorrt, micoNPU, Runtime
- **Frontend & Relay**
- Arith, **TOPI**, **TIR**, TVMScript
- Docs, CI, **Misc**, **BugFix**

Please visit the full listing of commits for a complete view: [v0.14.0...v0.15.0](https://github.com/apache/tvm/compare/v0.14.0...v0.15.0).

### Community
 * [#16172](https://github.com/apache/tvm/pull/16172) - Yixin Dong -> Reviewer
 * [#16162](https://github.com/apache/tvm/pull/16162) - Shuai Yuan -> Committer
 * [#16164](https://github.com/apache/tvm/pull/16164) - Qiang Zhang -> Committer
 * [#16166](https://github.com/apache/tvm/pull/16166) - Bohan Hou -> PMC
 * [#16165](https://github.com/apache/tvm/pull/16165) - Ruihang Lai -> PMC

### RFCs

 * [#105](https://github.com/apache/tvm-rfcs/pull/105) - Add a new backend language——SYCL

----

### Adreno
 * [#15991](https://github.com/apache/tvm/pull/15991) - [CI] Enhancements to Adreno specific CI utils
 * [#15786](https://github.com/apache/tvm/pull/15786) - [TOPI] Add conv2d transpose nchw texture schedule

### Arith
 * [#16227](https://github.com/apache/tvm/pull/16227) - Simplify nested if_then_else when constant is appearing in then_expr

### ArmComputeLibrary
 * [#15990](https://github.com/apache/tvm/pull/15990) - [ACL] Update Compute Library to v23.08

### Metal
 * [#16192](https://github.com/apache/tvm/pull/16192) - [Device] Fix metal warp size
 * [#16033](https://github.com/apache/tvm/pull/16033) - [Codegen] Disable cross-function call in Metal codegen

### cuda & cutlass & tensorrt
 * [#16061](https://github.com/apache/tvm/pull/16061) - [CUDA] Add an option for profiling cuda kernels

### micoNPU
 * [#16003](https://github.com/apache/tvm/pull/16003) - [microNPU][ETHOSU] Fix ConcatRewriter args processing
 * [#15929](https://github.com/apache/tvm/pull/15929) - [microNPU][ETHOSU] Fix rounding mode in requantize operation

### Runtime
 * [#15896](https://github.com/apache/tvm/pull/15896) - [CLML] Fix for CLML ops and enable more test case
 * [#16133](https://github.com/apache/tvm/pull/16133) - Parallel-for with threading backend
 * [#16066](https://github.com/apache/tvm/pull/16066) - Support clear global memory allocators
 * [#16030](https://github.com/apache/tvm/pull/16030) - Introduce `TVM_MODULE_VTABLE` Macros

### BugFix
 * [#16269](https://github.com/apache/tvm/pull/16269) - Update pillow usage
 * [#16272](https://github.com/apache/tvm/pull/16272) - Fixed Inappropriate Logical Expression
 * [#16216](https://github.com/apache/tvm/pull/16216) - [TIR] Fix dynamic smem merge leaf alloc
 * [#16190](https://github.com/apache/tvm/pull/16190) - Fix the error of reloading the model library on the ROCm platform: "MIOpen Error: No invoker was registered for convolution forward.”
 * [#16167](https://github.com/apache/tvm/pull/16167) - [Relay][Pytorch] Fix missing `.dtype`
 * [#16091](https://github.com/apache/tvm/pull/16091) - [Fix] Fix `topi.rms_norm` with float32 upscale
 * [#16081](https://github.com/apache/tvm/pull/16081) - [Fix] Broken Windows Build with LLVM
 * [#16051](https://github.com/apache/tvm/pull/16051) - [Fix][TIR] Fix dtype issues for match_buffer and ramp node
 * [#14655](https://github.com/apache/tvm/pull/14655) - [VTA] Fix FSIM compile error on macOS
 * [#16021](https://github.com/apache/tvm/pull/16021) - [FFI] Typo fix of IncRef to DecRef
 * [#16010](https://github.com/apache/tvm/pull/16010) - [Fix][TIR] fix mul dtype mismatch
 * [#16000](https://github.com/apache/tvm/pull/16000) - [Fix][TIR] fix symbolic strides lower
 * [#15970](https://github.com/apache/tvm/pull/15970) - [Hotfix] Mark python-FFI handling with TVM_DLL
 * [#15965](https://github.com/apache/tvm/pull/15965) - [CI] Better to pass the build folder

### CI
 * [#16110](https://github.com/apache/tvm/pull/16110) - Refactor unittest folder
 * [#16055](https://github.com/apache/tvm/pull/16055) - Fix broken links about Jenkins
 * [#16062](https://github.com/apache/tvm/pull/16062) - Use LLVM 17 for tests on `ci_arm`
 * [#16018](https://github.com/apache/tvm/pull/16018) - [Tests] Fix work_dir location used by test_micro_tuning_with_meta_schedule
 * [#16019](https://github.com/apache/tvm/pull/16019) - [Tests] Check int8+int32 testcases in test_estimate_peak_flops_cpu
 * [#16017](https://github.com/apache/tvm/pull/16017) - [Tests] Fix str vs. int comparison in test_num_threads

### Docs
 * [#16282](https://github.com/apache/tvm/pull/16282) - [Doc] Fix minor error in doc (Add an operator to Relay)
 * [#16152](https://github.com/apache/tvm/pull/16152) - [DOC] Add v0.14.0 docs to site
 * [#16127](https://github.com/apache/tvm/pull/16127) - Revert "[#15157][Rust][Doc] Re-enable the Rust documentation build (#15213)"
 * [#16097](https://github.com/apache/tvm/pull/16097) - Add missing backtick to contribute/code_guide.rst
 * [#16089](https://github.com/apache/tvm/pull/16089) - Fix error on linting by adding `--rev` argument
 * [#16024](https://github.com/apache/tvm/pull/16024) - Update release_process.rst about version number modification

### Frontend & Relay
 * [#16243](https://github.com/apache/tvm/pull/16243) - [TFLite] Add support for quantized mirror pad
 * [#15914](https://github.com/apache/tvm/pull/15914) - [TFLite]Support quantized SQUARE
 * [#16159](https://github.com/apache/tvm/pull/16159) - [KERAS] Fix bug concat convert for NCHW
 * [#16319](https://github.com/apache/tvm/pull/16319) - [Torch] add aten:broadcast_to
 * [#16131](https://github.com/apache/tvm/pull/16131) - [Pytorch] Add support for `aten::unflatten`
 * [#16105](https://github.com/apache/tvm/pull/16105) - [Pytorch] Add support for `aten::bitwise_and`
 * [#16079](https://github.com/apache/tvm/pull/16079) - [Pytorch] Add support for aten::swapaxes operator
 * [#15502](https://github.com/apache/tvm/pull/15502) - [Pytorch] aten::copy_ support for pytorch
 * [#16180](https://github.com/apache/tvm/pull/16180) - [Pytorch] Fix bug when converting models with torch.nn.ParameterList
 * [#16143](https://github.com/apache/tvm/pull/16143) - [Pytorch] Add support for `aten::scaled_dot_product_attention`
 * [#16123](https://github.com/apache/tvm/pull/16123) - [Pytorch] Add support for `aten::linalg_vector_norm`
 * [#16171](https://github.com/apache/tvm/pull/16171) - [Frontend] Preserve Pytorch Span Names
 * [#16217](https://github.com/apache/tvm/pull/16217) - [Frontend][QNN] fix access `param_debug_name_map` to node output name in fx-quantized graph node replacement
 * [#16199](https://github.com/apache/tvm/pull/16199) - [Frontend] Add support for aten::concat
 * [#16151](https://github.com/apache/tvm/pull/16151) - conv3d depthwise bug fix
 * [#15928](https://github.com/apache/tvm/pull/15928) - Expose qnn ops directly from relay.qnn module

### TOPI
 * [#16259](https://github.com/apache/tvm/pull/16259) - Add support for group_conv3d_transpose_ncdhw for generic
 * [#16052](https://github.com/apache/tvm/pull/16052) - Enhance `topi.nn.matmul`
 * [#16080](https://github.com/apache/tvm/pull/16080) - Reduce code redundancy in conv2d weights transformation
 * [#16248](https://github.com/apache/tvm/pull/16248) - [TOPI] Add support for group_conv1d_transpose_ncw for generic
 * [#16106](https://github.com/apache/tvm/pull/16106) - [TOPI] Add conv2d NHWC hybrid schedule for `arm_cpu`

### TIR 
 * [#16239](https://github.com/apache/tvm/pull/16239) - [Schedule] TileWithTensorIntrin skip incorrect ComputeInline for input-padding
 * [#16236](https://github.com/apache/tvm/pull/16236) - ConvertSSA process entry func first
 * [#16070](https://github.com/apache/tvm/pull/16070) - [Transform] Introduce new `InjectPermutedLayout` pass
 * [#16083](https://github.com/apache/tvm/pull/16083) - Enhance Python Type Annotations for TIR Expr
 * [#16073](https://github.com/apache/tvm/pull/16073) - Support more mma intrinsics and `get_mma_intrin_group` utility
 * [#16076](https://github.com/apache/tvm/pull/16076) - Enhance Python Type Annotations for TIR stmt
 * [#16074](https://github.com/apache/tvm/pull/16074) - Fix the thread binding iter_var dtype in `Bind` primitive
 * [#16063](https://github.com/apache/tvm/pull/16063) - Fix pass RenewDefs error in gather/take case
 * [#16027](https://github.com/apache/tvm/pull/16027) - Fix software pipeline with dynamic loop extent

### TVMScript
 * [#16271](https://github.com/apache/tvm/pull/16271) - Disable concise scoping when the scope stmt is explicitly annotated
 * [#16041](https://github.com/apache/tvm/pull/16041) - Fix mismatched dtype of IterVar in `T.thread_binding`
 * [#15953](https://github.com/apache/tvm/pull/15953) - [TIR] Pretty print TIR LLVM function name
 * [#15972](https://github.com/apache/tvm/pull/15972) - delete print extra info at parsing

### Misc
 * [#16279](https://github.com/apache/tvm/pull/16279) - replace deprecated np.int with int to avoid crash
 * [#16262](https://github.com/apache/tvm/pull/16262) - Update conv2d.py
 * [#16255](https://github.com/apache/tvm/pull/16255) - [Support] Add Interrupt Handling in Pipe
 * [#16104](https://github.com/apache/tvm/pull/16104) - [LoopPartition] Fix a bug of LoopPartition in single point scenarioes
 * [#16231](https://github.com/apache/tvm/pull/16231) - [Target] Add Jetson AGX Orin tags
 * [#16221](https://github.com/apache/tvm/pull/16221) - remove deprecated np.int in slice converter (pytorch)
 * [#16214](https://github.com/apache/tvm/pull/16214) - [Python] Fix setup.py for inplace build
 * [#16174](https://github.com/apache/tvm/pull/16174) - Bump cryptography from 37.0.2 to 41.0.6 in /docker/python
 * [#16202](https://github.com/apache/tvm/pull/16202) - Fix IRModule initialization with attrs
 * [#16176](https://github.com/apache/tvm/pull/16176) - Enable ccache to accelerate contrib compilation
 * [#15968](https://github.com/apache/tvm/pull/15968) - Add missing backtick
 * [#16034](https://github.com/apache/tvm/pull/16034) - [Packaging] Include BYOC dynamic libraries into wheel
 * [#16087](https://github.com/apache/tvm/pull/16087) - Add _ffi_api.py under script folder
 * [#16039](https://github.com/apache/tvm/pull/16039) - [Target] Support obtain l2 cache size from target
 * [#16065](https://github.com/apache/tvm/pull/16065) - [Pylint] fix pylint issues from test_random to test_tedd
 * [#16031](https://github.com/apache/tvm/pull/16031) - [TRT] fix outdated module building method in tensorrt
 * [#16032](https://github.com/apache/tvm/pull/16032) - [CMake] Use llvm-config to locate Findzstd.cmake
 * [#16023](https://github.com/apache/tvm/pull/16023) - [Pylint] fix pylint issues for thrust&tflite_runtime&util
 * [#15998](https://github.com/apache/tvm/pull/15998) - [Codegen] Add shuffle for cuda and metal
 * [#16015](https://github.com/apache/tvm/pull/16015) - [Pylint] fix pylint issues for cblas
 * [#15955](https://github.com/apache/tvm/pull/15955) - [FFI][Python] Handle error propagation when line number is missing
 * [#15982](https://github.com/apache/tvm/pull/15982) - Bump werkzeug from 2.2.3 to 3.0.1 in /apps/microtvm
 * [#15966](https://github.com/apache/tvm/pull/15966) - [CMake] Fix order of GNUInstallDirs module
 * [#15952](https://github.com/apache/tvm/pull/15952) - Update ci_arm Docker tag
 * [#15940](https://github.com/apache/tvm/pull/15940) - [Minor] Fix compilation warnings for clang
 * [#15947](https://github.com/apache/tvm/pull/15947) - Bump urllib3 from 1.26.9 to 1.26.18 in /docker/python
 * [#15835](https://github.com/apache/tvm/pull/15835) - [CodeGenC][Redo] Handle GlobalVar callee as internal function call
 * [#15945](https://github.com/apache/tvm/pull/15945) - Bump urllib3 from 1.26.15 to 1.26.18 in /apps/microtvm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Release] v0.15.0 Release Candidate Notes #16391

Introduction

Community

RFCs

Adreno

Arith

ArmComputeLibrary

Metal

cuda & cutlass & tensorrt

micoNPU

Runtime

BugFix

CI

Docs

Frontend & Relay

TOPI

TIR

TVMScript

Misc

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Release] v0.15.0 Release Candidate Notes #16391

Description

Introduction

Community

RFCs

Adreno

Arith

ArmComputeLibrary

Metal

cuda & cutlass & tensorrt

micoNPU

Runtime

BugFix

CI

Docs

Frontend & Relay

TOPI

TIR

TVMScript

Misc

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions