Welcome to this repo!
Here, you will find the exercises from udemy's course Spark and Python for Big Data with PySpark.
Jupyter Notebook: Application of a Linear Regression Classifier to predict the number of crew members will be needed in ships
Jupyter Notebook: Application of a Binary Classifier to predict customer churn for a marketing agency
Jupyter Notebook: Application of a Random Forest Classifier to identify the chemical with most impact on spoiled dog food through feature importance
Jupyter Notebook: Application of K-Means clustering to discover how many hackers performed a set of attacks to a large technology firm