Computer Vision Model Implementation by Pytorch

This is the repo including multiple Computer Vision Model Implementation with full explanation. For each model, there is tutorial for any operation.

Model with task category:

Image Classification

ResNet
Vision Transformer
Swin Transformer

2D Object Detection

SSD
CenterNet
DETR (todo)
RT-DETR (todo)
Deformable Convolutional Networks (todo)
Deformable-DETR (todo)

2D Semantic Segmentation

DeepLabV3
DeepLabV3+
SegFormer (todo)
Segmenter (todo)
OCRNet (todo)

3D Semantic Segmentation/Classification

PointNet
PointNet++
PointTransformer (todo)

3D Object Detection

VoteNet (todo)
BevFormer (todo)
PointPillar (todo)
VoxelNet (todo)

GenAI

Diffusion Model

Denoising Diffusion Probability Model (DDPM)
Denoising Diffusion Implicit Model (DDIM)
Classifier-Free Diffusion Guidance (CFDG)
Image Super-Resolution via Iterative Refinement (SR3) (todo)

Auto Encoder

Variational Autoencoder (VAE)
Vector Quantization Variational Autoencoder (VQVAE)

Pixel CNN

PixelCNN
Gated PixelCNN

GAN

CGAN
DCGAN

Name		Name	Last commit message	Last commit date
Latest commit History 224 Commits
Classification_2d		Classification_2d
Dataset		Dataset
Docker		Docker
GenAI		GenAI
Object_detection_2d		Object_detection_2d
Segmentation_2d		Segmentation_2d
Segmentation_3d		Segmentation_3d
core		core
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computer Vision Model Implementation by Pytorch

Image Classification

2D Object Detection

2D Semantic Segmentation

3D Semantic Segmentation/Classification

3D Object Detection

GenAI

Diffusion Model

Auto Encoder

Pixel CNN

GAN

About

Uh oh!

Releases

Packages

Languages

tungyen/Deep_learning_CV

Folders and files

Latest commit

History

Repository files navigation

Computer Vision Model Implementation by Pytorch

Image Classification

2D Object Detection

2D Semantic Segmentation

3D Semantic Segmentation/Classification

3D Object Detection

GenAI

Diffusion Model

Auto Encoder

Pixel CNN

GAN

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages