Learning to Interpret Satellite Images Using Wikipedia

This repository contains the implementation of our IJCAI 2019 paper Learning to Interpret Satellite Images Using Wikipedia. This work performs large scale pre-training on satellite images using weak supervision from the geolocated Wikipedia articles. The proposed concept can reduce the sample complexity and uncertainty in downstream tasks involving satellite images as well as improve the accuracy.

arxiv https://arxiv.org/pdf/1905.02506.pdf

IJCAI2019 https://www.ijcai.org/Proceedings/2019/0502.pdf

Requirements

Prerequisites: Please install the dependencies in the environment.yml file as:

  conda env create -f environment.yml

Process the Geolocated Articles using Doc2Vec

You can find how to process geolocated articles using Doc2Vec in this repository https://github.com/ermongroup/WikipediaPovertyMapping

For each wikipedia article, we learn a 300-D textual embedding and save them in a numpy array file. This file can be treated as the rich summary of corresponding satellite image. We then train the CNN to learn this summaries to learn robust, satellite-specific features that can be highly useful for transfer learning involving satellite images.

Creating the CSV file

After processing articles using Doc2Vec, we form a csv file for both training and validation steps. The csv files are formatted as below.

    Embedding Location, Image Location, Size of the Image
    directory, directory, integer

Our dataset consists of 800k 1000x1000 pixels images. Additionally, we perform data augmentation by cropping the central 250x250, 500x500, 750x750, and 1000x1000 pixels areas resulting in 3200k images. Finally, we save the csv files into the data directory.

Training

To train the CNN, we should use the following command:

    python im2text_matching.py --lr 1e-4 --cv_dir {path} --batch_size 128 --train_csv {path} --val_csv {path}

If we initialize the weights by pre-training on ImageNet, the training step takes only 2 epochs. However, initializing weights randomly increases the number of epochs to 15. Our current code uses the model pre-trained on ImageNet.

At the end of training for 2 epochs, we can see the cosine loss going down from ~1 to ~0.35. We save the checkpoints and perform transfer learning experiments.

Download DenseNet121 Model Pre-trained on WikiSatNet

KERAS Model Pre-trained on WikiSatNet https://drive.google.com/open?id=1Q69nGbhXFYoeJlgge-UPTZRVP0oL7Uy8

You can use this model for the downstream tasks that involves analyzing satellite images. It should provide significant boost especially with the small size datasets in the target task.

Transfer Learning on the functional Map of the World (fMoW) Dataset

fMoW Dataset https://github.com/fMoW/dataset

As a downstream task, we use the fMoW dataset consisting of high resolution satellite images similar to our pre-training dataset. It includes about 350k training images, together with 50k validation and test images. We pre-process the articles using the guidelines provided in the repository of the dataset. Similarly to the pre-training step, we form a csv file in the following format:

    Class Label, Image Location

Once we created the csv file for training and validation steps, we save it into the data directory.

Using DenseNet161 model pre-trained on ImageNet we achieve 68.7 classification accuracy on the temporal views. On the other hand, we achieve 73.1 classification accuracy by using the model pre-trained on WikiSatNet with image to text matching. More importantly, when the number of training samples on the target task is reduced to 10k labels, the model pre-trained on WikiSatNet outperforms ImageNet pre-training by 10%.

To perform transfer learning on the fMoW dataset, you can use the following commands:

  python transfer_learning.py --lr 1e-4 --cv_dir {path} --batch_size 128 --load {path_to_checkpoints} --train_csv {path} --val_csv {path}

To cite our paper:

@article{uzkent2019learning,
  title={Learning to Interpret Satellite Images in Global Scale Using Wikipedia},
  author={Uzkent, Burak and Sheehan, Evan and Meng, Chenlin and Tang, Zhongyi and Burke, Marshall and Lobell, David and Ermon, Stefano},
  journal={arXiv preprint arXiv:1905.02506},
  year={2019}
}

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
figures		figures
README.md		README.md
data_loader.py		data_loader.py
environment.yml		environment.yml
im2text_pretraining.py		im2text_pretraining.py
transfer_learning.py		transfer_learning.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning to Interpret Satellite Images Using Wikipedia

Requirements

Process the Geolocated Articles using Doc2Vec

Creating the CSV file

Training

Download DenseNet121 Model Pre-trained on WikiSatNet

Transfer Learning on the functional Map of the World (fMoW) Dataset

About

Uh oh!

Releases

Packages

Languages

uzkent/WikiSatNet

Folders and files

Latest commit

History

Repository files navigation

Learning to Interpret Satellite Images Using Wikipedia

Requirements

Process the Geolocated Articles using Doc2Vec

Creating the CSV file

Training

Download DenseNet121 Model Pre-trained on WikiSatNet

Transfer Learning on the functional Map of the World (fMoW) Dataset

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages