VLR Course Assignments

This repository contains completed assignments for the Visual Learning and Recognition (VLR) course, covering various computer vision and deep learning topics.

Overview

The repository contains three comprehensive assignments:

Assignment 1: Object Detection using FCOS (Fully Convolutional One-Stage Object Detection)
- Implementation of the FCOS detector with FPN (Feature Pyramid Network)
- Tasks include feature extraction, box regression, and centerness prediction
- Code for training and testing the detector on the PASCAL VOC dataset
Assignment 2: Generative Models
- GAN implementation with different loss functions (Original GAN, LSGAN, WGAN-GP)
- VAE implementation with different latent space dimensions and beta values
- Diffusion models with DDPM and DDIM sampling techniques
Assignment 3: Transformers for Vision
- Transformer for image captioning using the COCO dataset
- Vision Transformer (ViT) implementation for image classification

Additional Information

Please refer to the individual README files in each assignment directory for:

Detailed setup instructions and dependencies
Dataset preparation and requirements
Implementation details
Training and evaluation procedures

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
hw1		hw1
hw2		hw2
hw3		hw3
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VLR Course Assignments

Overview

Additional Information

About

Uh oh!

Releases

Packages

Languages

akameswa/VisualLearningRecognition

Folders and files

Latest commit

History

Repository files navigation

VLR Course Assignments

Overview

Additional Information

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages