Skip to content
View raw9k's full-sized avatar
🎯
Focused
🎯
Focused

Highlights

  • Pro

Block or report raw9k

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
raw9k/README.md

Typing SVG


 Who Am I

name: Rounak Kumar
degree: Integrated MSc. Quantitative Economics & Data Science
university: BIT Mesra
role: Joint President — Data Science Society
focus: Production-grade ML systems, statistical modeling & MLOps
deploys_with: [Docker, GitHub Actions, Azure, Render, Vercel]
currently_learning:
  - AI Agents & Agentic Workflows
  - LLMOps & AIOps
motto: "Ship models, not just notebooks."

coding gif

Tech Arsenal

💻 Languages

Python R SQL C++

🤖 ML / Data Science

Scikit-learn TensorFlow PyTorch Keras LightGBM CatBoost XGBoost Pandas Seaborn OpenCV

🔧 Backend & Databases

Flask FastAPI Streamlit MongoDB MySQL

☁️ MLOps & Cloud

Docker GitHub Actions Azure AWS MLflow Comet ML Kubernetes Jenkins Git Linux

📊 Data Analytics & BI

Power BI Snowflake


Featured Projects

🎌 Hybrid Anime Recommendation System

 

Scalable hybrid recommendation system processing 70M+ user–anime ratings, combining collaborative & content-based filtering with Keras embeddings and Azure Blob Storage ingestion.

🛡️ Network Security System

 

Production-grade anomaly & phishing detection with MongoDB Atlas ingestion, schema validation, MLflow experiment tracking & real-time prediction via FastAPI.

🧠 Student Performance Predictor

 

End-to-end pipeline forecasting student scores (R² ≈ 0.87) with automated ingestion, quality checks, feature engineering & CI/CD to Azure Web App with drift-ready hooks.

🪪 UIDAI Hackathon 2026

End-to-end ETL pipeline consolidating ~44 lakh Aadhaar Enrolment & Update records (NIC split CSVs) into an analysis-ready source of truth, with geography standardization, deduplication, KPI engineering, and an interactive Power BI dashboard for district-level decision-making.


Coding Stats

Streak Stats



Pinned Loading

  1. UIDAI-hackathon-2026 UIDAI-hackathon-2026 Public

    Forked from apooorv19/UIDAI-hackathon-2026

    Official repository for Team UIDAI 4732. A comprehensive data analysis solution for the UIDAI Data Hackathon 2026.

    Jupyter Notebook 1

  2. hybrid-anime-recommender hybrid-anime-recommender Public

    Built an end-to-end hybrid anime recommendation system that ingests and preprocesses millions of user-anime ratings from Azure Blob Storage, trains an embedding-based neural network with collaborat…

    Jupyter Notebook 1

  3. network-security-system network-security-system Public

    The Network Security System is an end-to-end phishing detection pipeline built with FastAPI and modular ML components. It allows users to either: Upload a CSV file for batch phishing detection (res…

    Python 1

  4. students-performance-ml students-performance-ml Public

    Interactive ML web app that predicts student math scores from academic and demographic data. Features a modular end-to-end pipeline, multiple regression models with hyperparameter tuning, and a Fla…

    Jupyter Notebook 1

  5. Algerian-Forest-Fire-Linear-Regression-Project Algerian-Forest-Fire-Linear-Regression-Project Public

    This project uses machine learning to predict the Fire Weather Index (FWI) in Algerian forests based on meteorological and environmental features. Built with Flask, it provides a user-friendly web …

    Jupyter Notebook 1

  6. DSA-Practice DSA-Practice Public

    A curated collection of Data Structures & Algorithms problems I’ve solved, with clear explanations and optimized solutions in Python. This repo serves as a personal archive for practice, revision, …

    Jupyter Notebook 1