Skip to content

fhildeb/ba-big-data

Repository files navigation

ba-big-data

BA Course for Big Data from the University of Applied Science Mittweida that took place in 2019.

NOTE: Filenames might appear in German.

Course Contents

The science course provided a robust data analysis and predictive modeling foundation, delved into the essential methodologies and best practices in data mining, and taught advanced statistical techniques to uncover hidden patterns in large datasets. It also included a variety of machine learning algorithms, classification techniques, and predictive models while exploring methods for enhancing data-driven recommendations and insights. With dimensionality reduction techniques, even optimization of data processing played a role. The core features can be seen below.

  • Introduction & Motivation
  • CRISP-DM Process Model
  • ROC Analyses
  • Bayesian Classifiers
  • K-Nearest-Neighbor Classification
  • Decision Trees
  • Support Vector Machine (SVM)
  • Neural Networks
  • Recommendation Engines
  • Cluster and Association Analysis
  • Principal Component/Factor Analysis
  • Eigenvector Decomposition
  • Singular Value Decomposition (SVD)

Repository Contents

The course featured ten practical units, half of them featuring direct analytics.

  1. Linear Regression
    • MSE, RMSE, Visualization
    • Selection, Weighting, Modeling
  2. K-Nearest Neighbor
    • Modeling, Selection
    • Optimization, Data Splitting
  3. Data Preprocessing
  4. Naive Bayes Calculation
  5. ROC Performance Measurement

Tools

RapidMiner

About

HSMW BA Big Data Course

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published