Skip to content

ruidbras/PySpark-ML-Notebooks

Repository files navigation

PySpark-ML-Notebooks

Welcome to this repo!

Here, you will find the exercises from udemy's course Spark and Python for Big Data with PySpark.

Linear Regression Classifier on Cruise Ships Dataset

Jupyter Notebook: Application of a Linear Regression Classifier to predict the number of crew members will be needed in ships

Logistic Regression Classifier on Customer Churn Dataset

Jupyter Notebook: Application of a Binary Classifier to predict customer churn for a marketing agency

Random Forest Classifier on Dog Food Dataset

Jupyter Notebook: Application of a Random Forest Classifier to identify the chemical with most impact on spoiled dog food through feature importance

K-Means Clustering on Hacking Attacks Dataset

Jupyter Notebook: Application of K-Means clustering to discover how many hackers performed a set of attacks to a large technology firm

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors