Video Search Engine

Authors:

Semantically be able to search through a database of videos (using generated summaries)

System Overview

The system described here is the overview of the overall system archietecture.

Video Summarization Overview

Below is the initial architecture of the video summarization network used to generate video summaries.

Example output

Given a minute long video of traffic in Dhaka Bangladesh.

('a man riding a bike down a street next to a large truck .', 'a man riding a bike down a street next to a traffic light .', 'a green truck with a lot of cars on it', 'a green truck with a lot of cars on the road .', 'a city bus driving down a street next to a traffic light .')

Set Up

To set up the python code create a python3 environment with the following:

# create a virtual environment
$ python3 -m venv env

# activate environment
$ source env/bin/activate

# install all requirements
$ pip install -r requirements.txt

# install data files
$ python dataloader.py

If you add a new package you will have to update the requirements.txt with the following command:

# add new packages
$ pip freeze > requirements.txt

And if you want to deactivate the virtual environment

# decativate the virtual env
$ deactivate

Training Captioning Network

Caption Network Set up

python VideoSearchEngine/ImageCaptioningNoYolo/resize.py --image_dir data/coco/train2014/ 
python VideoSearchEngine/ImageCaptioningNoYolo/resize.py --image_dir data/coco/val2014/ --output_dir data/val_resized2014

Plan

Our project will, broadly defined, be attempting video searching through video summarization. To do this we propose the following objectives and resulting action plan:

Break videos down into semantically different groups of frames
Recognize objects in an image (i.e. a frame)
Convert a frame to text
Merge summaries of all frames of a video into one large overall summary
Build a search engine to query videos via summary.

Goals

For our project, we have come up with a basic goal we plan to reach by the time of the presentation, and a stretch goal we hope to reach if time permits

Basic Goal: We will recognize objects through the YOLO algorithm. Convert each frame to text using the algorithm mentioned in this paper. Come up with basic heuristic for skipping frames so not too much overlap in the summary. Surface all of this through a simple UI to search a video database.

Stretch Goal: Investigate other methods for reducing noise in frames (Generative Adversarial Networks), Investigate grouping together semantically similar frames to one common representation to make better summaries.

Name		Name	Last commit message	Last commit date
Latest commit History 316 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
.vscode		.vscode
VideoSearchEngine		VideoSearchEngine
conf		conf
data		data
figs		figs
saved/image_yolo		saved/image_yolo
.DS_Store		.DS_Store
.gitignore		.gitignore
Computer Vision Presentation.key		Computer Vision Presentation.key
LICENSE		LICENSE
README.md		README.md
VideoSearchEnginePoster.pdf		VideoSearchEnginePoster.pdf
VideoSearchEnginePoster.ppt		VideoSearchEnginePoster.ppt
downloader.py		downloader.py
requirements.txt		requirements.txt
workerStartup.sh		workerStartup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video Search Engine

System Overview

Video Summarization Overview

Example output

Set Up

Training Captioning Network

Caption Network Set up

Plan

Goals

Data Sets to Use

TaCos MulitModal Data Set

Common Object Data Set

Sum Me Data Set

MED Dataset

Citations

Papers

GitHubs

Blogs and Other Websites

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Video Search Engine

System Overview

Video Summarization Overview

Example output

Set Up

Training Captioning Network

Caption Network Set up

Plan

Goals

Data Sets to Use

TaCos MulitModal Data Set

Common Object Data Set

Sum Me Data Set

MED Dataset

Citations

Papers

GitHubs

Blogs and Other Websites

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages