Object-Detection-Algorithms

Pytorch implementation of seven state-of-the-art algorithms for object detection.

Introduction

Object recognition is a general term to describe a collection of related computer vision tasks that involve identifying objects in digital photographs.

Image classification involves predicting the class of one object in an image. Object localization refers to identifying the location of one or more objects in an image and drawing abounding box around their extent. Object detection combines these two tasks and localizes and classifies one or more objects in an image.

As such, we can distinguish between these three computer vision tasks:

Image Classification: Predict the type or class of an object in an image. - Input: An image with a single object, such as a photograph. - Output: A class label (e.g. one or more integers that are mapped to class labels).
Object Localization: Locate the presence of objects in an image and indicate their location with a bounding box. - Input: An image with one or more objects, such as a photograph. - Output: One or more bounding boxes (e.g. defined by a point, width, and height).
Object Detection: Locate the presence of objects with a bounding box and types or classes of the located objects in an image. - Input: An image with one or more objects, such as a photograph. - Output: One or more bounding boxes (e.g. defined by a point, width, and height), and a class label for each bounding box.

Pascal VOC Dataset

For Pascal VOC dataset, make the folder structure like this:

VOC_ROOT
|__ VOC2007
    |_ JPEGImages
    |_ Annotations
    |_ ImageSets
    |_ SegmentationClass
|__ VOC2012
    |_ JPEGImages
    |_ Annotations
    |_ ImageSets
    |_ SegmentationClass
|__ ...

Where VOC_ROOT default is datasets folder in current project.

Project Structure

.
├─ datasets/
│  ├─ VOC2007/               <- VOC2007 dataset folder
│  │  	└─...
│  ├─ VOC2012/               <- VOC2012 dataset folder
│  │  	└─...
│  ├─ VOC2007.sh             <- script to download VOC2007 dataset
│  └─ VOC2012.sh             <- script to download VOC2012 dataset
│
├─ CenterNet/                <- Implementation of CenterNet
│  └─ ...
├─ DETR/                     <- Implementation of DEtection TRansformer (DETR)
│  └─ ...
├─ Faster-RCNN/              <- Implementation of Faster-RCNN
│  └─ ...
├─ SSD/                      <- Implementation of Single Shot Detector (SSD)
│  └─ ...
├─ Yolo-v1/                  <- Implementation of YOLO-v1
│  └─ ...
├─ Yolo-v2/                  <- Implementation of YOLO-v2
│  └─ ...
├─ Yolo-v3/                  <- Implementation of YOLO-v3
│  └─ ...
│
└─ README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Object-Detection-Algorithms

Introduction

Pascal VOC Dataset

Project Structure

References

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
CenterNet		CenterNet
DETR		DETR
Faster-RCNN		Faster-RCNN
SSD		SSD
Yolo-v1		Yolo-v1
Yolo-v2		Yolo-v2
Yolo-v3		Yolo-v3
datasets		datasets
README.md		README.md
example.jpg		example.jpg
object_detection.png		object_detection.png

Ali-Sahili/Object-Detection-Algorithms

Folders and files

Latest commit

History

Repository files navigation

Object-Detection-Algorithms

Introduction

Pascal VOC Dataset

Project Structure

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages