Skip to content

prasiyer/ML_Logistics

Repository files navigation

Installation

In order to run the notebooks in this repository, the following libraries have to be installed:

  1. Pandas 0.24.2
  2. Numpy 1.17.4
  3. fastai 1.0.59
  4. autogluon 0.0.3
  5. seaborn 0.9.0
  6. scikit-learn 0.21.2

Project Motivation

The purpose of this project is to show:

  1. Basic data preparation techniques prior to data exploration
  2. Commonly used but powerful chart types for gaining insights
  3. Results from exploring prominent features pertaining to logistics transit time
  4. Basic code for training and predicting transit time for shipments

File Descriptions

  1. TT_EDA.ipnyb: This notebook contains the data exploration steps and results
  2. TransitTime_Fastai.ipnyb: This notebook contains the steps for training and predicting using the fastai library. One of the key highlights is the ability of fastai library to create embeddings for categorical features
  3. TransitTime_AG.ipnyb: This notebook demonstrates the use of Autogluon library to speed up experimentation of training and prediction with popular machine learning algorithms
  4. ML_Logistics_Simple_Guide.pdf: This document summarizes the approach and results shown in the Python notebooks. This document can be viewed as a standalone starter guide for data exploration and machine learning application in logistics domain
  5. TransitTime_EDA.pdf: This document summarizes the main insights generated from the exploratory analysis of the dataset

Acknowledgements

Thanks to Python open source community for creating valuable libraries used in this project.
This project uses normalized dataset of truckload shipments

License

Apache license

About

EDA and application of ML algorithms in Logistics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors