A curated list of vision transformer related resources. Please feel free to pull requests or open an issue to add papers.
| Title | Venue | BibTeX |
|---|---|---|
| A Survey on Visual Transformer | ArXiv | Bib |
| Intriguing Properties of Vision Transformers | ArXiv | Code |
| CVPR 2021 视觉Transformer论文(43篇) | github | -- |
| Task | Reg | Det | Seg | Trk | Other |
|---|---|---|---|---|---|
| Explanation | Image Recoginition | Object Detection | Image Segmentation | Object Tracking | other types |
You can add a tag for domains which contains several transformer-based works
(Pls follow Time Inverse Ranking)
| Title | Venue | Task | Code | BibTeX |
|---|---|---|---|---|
| End-to-End Video Instance Segmentation with Transformers | ArXiv | Seg | -- | -- |
| Training data-efficient image transformers & distillation through attention | ArXiv | Reg | GitHub | Bib |
| An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale | ICLR | Reg | GitHub | Bib |
| Toward Transformer-Based Object Detection | ArXiv | Det | --- | Bib |
| Rethinking Transformer-based Set Prediction for Object Detection | ArXiv | Det | --- | Bib |
| UP-DETR: Unsupervised Pre-training for Object Detection with Transformers | ArXiv | Det | --- | Bib |
| Deformable DETR: Deformable Transformers for End-to-End Object Detection | ArXiv | Det | GitHub | Bib |
| End-to-End Object Detection with Transformers | ECCV | Det | GitHub | Bib article{zhu2020deformable, |
| Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers | Arxiv | Seg | Github | Bib @article{zheng2020rethinking, |
| MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers | Arxiv | Seg | --- | Bib @article{wang2020max, |
| TransTrack: Multiple-Object Tracking with Transformer | ArXiv | Trk | GitHub | Bib |
| Title | Venue | Task | Code | BibTeX |
|---|---|---|---|---|
| Attention Is All You Need | NeurIPS'17 | -- | GitHub | Bib @inproceedings{vaswani2017attention, |