Skip to content
View ibranova's full-sized avatar
🎯
Focusing
🎯
Focusing
  • The Marcy Lab School
  • Brooklyn, New York, NY 11232
  • 03:26 (UTC -04:00)
  • LinkedIn in/ibranova

Block or report ibranova

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ibranova/README.md

Hey there, I'm Ibrahima Diallo

Screenshot 2026-01-09 at 11 11 04β€―PM

LinkedIn GitHub Email


I’m a Data Analyst driven by curiosity, resilience, and a strong commitment to continuous learning. I grew up in Guinea, West Africa, where access to technology was very limited for me growing up, but I was so facined about Inovations. After moving to the U.S., I taught myself English, adapted to a new culture quickly, and intentionally built the skills and mindset needed to work in tech.

I am currently completing an intensive Data Analytics training program at The Marcy Lab School, where I developed hands-on experience with data analysis, problem-solving, and critical thinking using tools like Python and SQL. What defines me most is my discipline, growth mindset, curiosity, and desire to use data to find solutions that have real-world impact.

This GitHub is where I document my learning journey, share projects, and continue growing one step at a time.

About Me

I'm a Data Analyst based in New York City with a passion for transforming complex datasets into actionable business insights. Currently completing 1,800+ hours of immersive training in data analysis, statistics, machine learning, and visualization at The Marcy Lab School.

πŸ† 2nd Place Winner β€” MTA Datathon 2025 (out of 180+ teams)

Technologies and Tools

Languages & Data

pandas logo scipy logo matplotlib logo plotly logo plotly logo

Python Libraries & Frameworks

plotly logo plotly logo scikit-learn logo

Visualization & BI Tools

Version Control & Cloud

plotly logo plotly logo plotly logo plotly logo

Productivity/Project Management & Tools

plotly logo plotly logo plotly logo plotly logo seaborn logo plotly logo


πŸš€ Featured Projects

πŸ“Š Business Intelligence & Analytics

Built an interactive Power BI dashboard to identify churn drivers for a telecom company.

Key Insights:

  • 27% overall churn rate identified
  • 45% of churn attributed to competition
  • California flagged with 60%+ churn rate
  • 72% of customers without plans churned

Power BI DAX Power Query

Analyzed guest satisfaction and revenue patterns using star-schema database design and RFM analysis.

Key Insights:

  • CA/NY markets show 3x higher CLV
  • Repeat guests spend 40% more
  • Recommended 25% Monday staffing increase

SQLite Python Matplotlib SQL CTEs

πŸ€– Machine Learning & Predictive Analytics

Built a classification model to predict Killed or Seriously Injured (KSI) outcomes from 342K+ motor vehicle collisions.

Results:

  • 58% Recall (vs. 0% baseline)
  • 0.60 ROC-AUC score
  • Deployed Streamlit web app for analysts

Python Scikit-learn XGBoost Streamlit

Applied statistical testing to uncover ridership patterns for product, operations, and marketing teams.

Statistical Tests:

  • Welch's t-test: p=0.00004 (working vs. non-working days)
  • ANOVA Fβ‰ˆ50, p<0.001 (seasonal differences)
  • Simulated A/B test design

Python Pandas Statistical Testing Matplotlib

πŸ—ΊοΈ Geospatial & Public Policy Analytics

Analyzed 3M+ bus violation records to uncover spatial and socio-economic patterns in NYC transit enforcement.

Impact:

  • πŸ₯ˆ 2nd place out of 180+ teams
  • Spatial analysis with MODZCTA boundaries
  • ANOVA revealed poverty-violation correlation
  • 37.6% repeat offender rate identified

Python SQL Tableau Geospatial Analysis

Analyzed Wi-Fi kiosk usage across 4,000+ kiosks to measure digital equity post-5G rollout.

Key Metrics:

  • 100% network uptime post-5G
  • 78% Week-1 retention rate
  • Equity-focused outer borough expansion
  • RFM segmentation for ad targeting

Python Plotly Folium Cohort Analysis


🎯 What I'm Currently Working On

  • πŸ”¬ Expanding my machine learning toolkit with deep learning fundamentals
  • πŸ“Š Building more interactive dashboards with Power BI and Tableau
  • 🌐 Contributing to open-source data projects
  • πŸ“š Preparing for junior data analyst roles
  • πŸŽ“ Preparing to get certified in:
    • AWS Cloud Practitioner, expected - March 2026
    • PL-300: Microsoft Power BI, expected - April 2026
    • Databricks Data Engineer Associate, expected - May 2026

πŸ“« Let's Connect!

I'm always open to discussing data projects, collaboration opportunities, or just chatting about the latest in analytics!

LinkedIn GitHub Email


Profile Views

Pinned Loading

  1. Customer_Churn-analysis-Power-BI Customer_Churn-analysis-Power-BI Public

  2. Theme-Park-Analytics Theme-Park-Analytics Public

    Jupyter Notebook

  3. Mod4_Project-Riding-the-Demand-Insights-for-a-Bike-Share-PM Mod4_Project-Riding-the-Demand-Insights-for-a-Bike-Share-PM Public

    Jupyter Notebook

  4. Mod5-Project-LinkNYC-Engagement-Analysis Mod5-Project-LinkNYC-Engagement-Analysis Public

    Analyzing LinkNYC Wi-Fi kiosk usage data to uncover engagement, retention, and equity trends that support smarter ad placements and digital inclusion across New York City.

    Jupyter Notebook 1

  5. MTA-Datathon-2025 MTA-Datathon-2025 Public

    Forked from MHC-Datathon/Nova-Alpha

    For Datathon 2025: An analysis of MTA's Bus Automated Camera Enforcement violations and its effect on NYC communities.

    Jupyter Notebook

  6. kabbosultan/NYC-Traffic-Collision-Modeling kabbosultan/NYC-Traffic-Collision-Modeling Public

    Jupyter Notebook