Lauren McCarthy, Nicole Pierre, Harrison Kmiec
Topic: Olympic medalists for winter and summer games comparison with country population and GDP
- Olympic Games medal Dataset (from 1896 to 2018).csv (https://www.kaggle.com/rushikeshlavate/olympic-games-medal-datasetfrom-1896-to-2018)
- Data.World: Olympic Medal History dictionary.csv (https://data.world/sportsvizsunday/sports-viz-sundays-2018/workspace/file?filename=Olympic+Medal+History+dictionary.csv)
- Dropped null values
- Stripped additional formatting from CSV for Team(IOC Code)
- Renamed Team(IOC Code) column and changed case for column titles / primary key
- Separating country and country codes from Olympic Medal DataFrame
- Combined separated columns to Olympic Medal DataFrame and dropped duplicate column
- Re-ordered column names
- Connected to pgAdmin within Jupyter Notebook
- Loaded in both DataFrames
- Checked to ensure tables were loaded
- Joined tables on country in pgAdmin
As a team we were interested in sports and decided to look into databases surrounding the Olympic Games. From what was available we selected two databases that considered Olympic Game winners across the globe while also looking at country population and GDP. For further analysis it would be interesting to see if there is a correlation between population/GDP and countries with the most wins (gold, silver, bronze).