Skip to content

abhi788741/olympics-data-analysis-Python

Repository files navigation

🏅 Summer Olympics Medal Analysis Project (1976–2008) -

🔍 Project Overview

This project performs a complete analysis of Summer Olympic Games medal data spanning 28 Olympic Games from Montreal 1976 to Beijing 2008. The analysis answers critical questions such as:

  • Which countries dominate medal tallies?
  • How does gender distribution look across events?
  • Which sports generate the most medals?
  • Who are the top-performing athletes?
  • How have medal counts evolved over the years?

📁 Project Structure

  • 📦 Summer-Olympics-Analysis/
  • ├── 📓 data_cleaning.ipynb
  • ├── 📓 exploratory_data_analysis.ipynb
  • ├── 📁 data
  • ├── 📄 Summer-Olympic-medals-1976-to-2008.csv
  • ├── 📊 cleaned_olympics_data.csv
  • └── 📘 README.md

🛠 Technologies Used

  • Python 3.13+
  • Pandas
  • NumPy
  • Matplotlib
  • Seaborn
  • Statistical analysis
  • Jupyter Notebook
  • EDA

💡 Key Insights

  • The USA's dominance in Aquatics (578 records) and Athletics (299) makes up a massive portion of their total medal count. Australia is a surprising #2 in Aquatics — consistent with their strong swimming culture.
  • ~61% of athletes participating in the Olympics are male, with female participation at ~39%.
  • The United States leads in the number of unique athletes sent to the Olympics across all editions.
  • Host cities with larger event counts naturally accumulated more total medals, with later Games (Sydney, Athens, Beijing) showing growth in overall medal opportunities.
  • Medal counts have generally increased over the years, reflecting the expansion of Olympic disciplines and events.
  • Medal records grew from 1,305 in 1976 to 2,042 in 2008 — a 56.5% increase over 32 years, driven by the continuous addition of new sports and disciplines to the Olympic programme.
  • Male athletes won nearly 60% more total medals than female athletes — mirroring the athlete participation ratio (61:39). Female athletes show a notably even spread across medal types, suggesting strong competitive depth.

🚀 Future Scope

  • 🤖 ML Medal Prediction Model — predict which country wins the most medals given year/sport

About

An end-to-end data analysis of 32 years of Summer Olympic Games (1976–2008), exploring 15,315 medal records across 127 countries and 11,337 athletes through rigorous data cleaning and exploratory analysis. Uncovers key insights on gender participation trends, country dominance, sport-wise medal distribution.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors