[RFC][WIP] Tensor Expression level automatic differentiation

I'm working on automatic differentiation at the level of compute expressions, and I would like to share some progress and hear any comments.  Currently the automatic differentiation works well enough for some operations, so that it is possible to train a simple model, [here](https://github.com/sgrechanik-h/tvm/blob/ad/training-with-tvm.ipynb) is a tutorial on how to do this. Yet, for many operations the performance is unacceptable, but I'm working on it.

My implementation mostly follows [this paper](https://arxiv.org/abs/1711.01348). In [this notebook](https://github.com/sgrechanik-h/tvm/blob/ad/about-autodiff.ipynb) I describe how my implementation works internally and give a list of operations which are known to work or not to work. Basically, the AD consists of two parts: 
- The automatic differentiation itself which simply differentiates expressions according to the well-known rules and produces inefficient expressions. The code is [here](https://github.com/sgrechanik-h/tvm/blob/ad/src/pass/autodiff.cc).
- A set of transformations to optimize the resulting inefficient expressions. The code is [here](https://github.com/sgrechanik-h/tvm/blob/ad/src/pass/zero_elimination.cc).

All transformations work on the level of compute expressions (before scheduling). Their general goal is to eliminate summation over zeros by moving up conditional expressions of the form `cond ? val : 0` and then using them to simplify iteration domains of reductions. Hopefully, these transformations may be useful for some other tasks besides AD when they are powerful enough. Currently the main problem is that they don't understand modular arithmetic (which is needed for differentiating dilated and strided convolutions and for the flattening operation).

[The git branch](https://github.com/sgrechanik-h/tvm/tree/ad)
[The squashed commit](https://github.com/sgrechanik-h/tvm/commit/9ecd5c73eef1fffe645b07137b0e295a08519da8)
[The tutorial on training a simple model](https://github.com/sgrechanik-h/tvm/blob/ad/training-with-tvm.ipynb)
[The notebook describing some internals](https://github.com/sgrechanik-h/tvm/blob/ad/about-autodiff.ipynb)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC][WIP] Tensor Expression level automatic differentiation #1996

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[RFC][WIP] Tensor Expression level automatic differentiation #1996

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions