GitHub - Toton642/DataExplorer

TrainingData.csv The provided (given) training dataset containing bitstreams and corresponding class labels.
main.py The main Python program where I did data preprocessing, trained the machine learning model RandomForest(Best Performing model), performed cross-validation, and saved the trained model for later use purposes. This also included a function to predict on an independent test dataset.
Documentation.pdf The documentation file that included the comparative analysis of the different models (GradientBoosting, XGBoost, and RandomForest) based on their cross-validation and validation performance and choosing the best performing model.
best_trained_model.joblib The trained RandomForest classifier model, which I identified as the best-performing model based on validation accuracy. This model is capable to be loaded and used for making predictions on new, unseen data.

Provide feedback

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Documentation.pdf		Documentation.pdf
README.md		README.md
TrainingData.csv		TrainingData.csv
best_trained_model.joblib		best_trained_model.joblib
main.py		main.py