Simple analytics of video games using pandas + matplotlib
This dataset contains a list of video games with sales greater than 100,000 copies. It was generated by a scrape of vgchartz.com.
The script to scrape the data is available at https://github.com/GregorUT/vgchartzScrape. It is based on BeautifulSoup using Python. There are 16,598 records. 2 records were dropped due to incomplete information.
Environment setup
python -m venv videogames
videogames/Scripts/activate.bat
Required packages
🏷️ Pandas - https://pandas.pydata.org
🏷️ Numpy - https://numpy.org
🏷️ matplotlib - https://matplotlib.org
Dataset
🏷️ Kaggle - https://www.kaggle.com/gregorut/videogamesales
🧹Rank - Ranking of overall sales
🧹Name - The games name
🧹Platform - Platform of the games release (i.e. PC,PS4, etc.)
🧹Year - Year of the game's release
🧹Genre - Genre of the game
🧹Publisher - Publisher of the game
🧹NA_Sales - Sales in North America (in millions)
🧹EU_Sales - Sales in Europe (in millions)
🧹JP_Sales - Sales in Japan (in millions)
🧹Other_Sales - Sales in the rest of the world (in millions)
🧹Global_Sales - Total worldwide sales.