This project performs a complete analysis of Summer Olympic Games medal data spanning 28 Olympic Games from Montreal 1976 to Beijing 2008. The analysis answers critical questions such as:
- Which countries dominate medal tallies?
- How does gender distribution look across events?
- Which sports generate the most medals?
- Who are the top-performing athletes?
- How have medal counts evolved over the years?
- 📦 Summer-Olympics-Analysis/
- ├── 📓 data_cleaning.ipynb
- ├── 📓 exploratory_data_analysis.ipynb
- ├── 📁 data
- ├── 📄 Summer-Olympic-medals-1976-to-2008.csv
- ├── 📊 cleaned_olympics_data.csv
- └── 📘 README.md
- Python 3.13+
- Pandas
- NumPy
- Matplotlib
- Seaborn
- Statistical analysis
- Jupyter Notebook
- EDA
- The USA's dominance in Aquatics (578 records) and Athletics (299) makes up a massive portion of their total medal count. Australia is a surprising #2 in Aquatics — consistent with their strong swimming culture.
- ~61% of athletes participating in the Olympics are male, with female participation at ~39%.
- The United States leads in the number of unique athletes sent to the Olympics across all editions.
- Host cities with larger event counts naturally accumulated more total medals, with later Games (Sydney, Athens, Beijing) showing growth in overall medal opportunities.
- Medal counts have generally increased over the years, reflecting the expansion of Olympic disciplines and events.
- Medal records grew from 1,305 in 1976 to 2,042 in 2008 — a 56.5% increase over 32 years, driven by the continuous addition of new sports and disciplines to the Olympic programme.
- Male athletes won nearly 60% more total medals than female athletes — mirroring the athlete participation ratio (61:39). Female athletes show a notably even spread across medal types, suggesting strong competitive depth.
- 🤖 ML Medal Prediction Model — predict which country wins the most medals given year/sport