Skip to content

Aaronsapa/data-crunching-python-class

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

data-crunching-python-class

Project to exemplify simple ETL techniques with Python 3 and pandas.

Requirements

  • Hombrew
  • PostgreSQL server
    • brew install postgres
  • Python 3
    • brew install python3
  • Jupyter Notebook
  • !pip3 install psycopg2-binary
  • !pip3 install nltk
  • nltk.download('stopwords')
  • nltk.download('wordnet')
  • !pip install wordcloud

Contents

  1. Personal Intro
  2. Introduction:sleeping:
    1. Data Crunching
    2. Understanding Raw Data
    3. Types of Data
    4. DBMS
    5. Hadoop Ecosystem 🐘
    6. Google Services 💰
    7. AWS Services
  3. Hands-On 🐍
    1. News NLP
    2. Twitter Stream
  4. Open form questions

Structure

10:00am - 1:00 pm CT (3 hours total 🕔 ).

  • 10:00am - 10:45am: Content
  • 10:45am - 11:00am: Break
  • 11:00am - 11:45am: Content
  • 11:45am - 12:00pm: Break
  • 12:00pm - 12:45pm: Content

About

Project to exemplify simple ETL techniques with Python 3 and pandas

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors