KirosG / spotify_top200_dataset

Analysis of the Spotify Top 200 Daily Including Clustering Listening Behavior by Country

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Spotify Analysis - Top 200 Songs Played in 2017

Data Source

This data set came from Kaggle: Data Set Link.

Explanation of Files and Directories

  • resources - images for the Jupyter notebooks
  • app.py - code for a Plotly online data dashboard
  • peak_dates_world_songs.csv - data for the Plotly dashboard. This data is outputted by spotifyanalysis.ipynb
  • regions.json - data for mapping country to language - used in the data dashboard
  • roughdraft.ipynb - exploratory data analysis
  • spotifyanalysis.ipynb - cleaned and formatted exploratory data analysis
  • spotifys-worldwide-daily-song-ranking.zip - original data set
  • total_plays_by_date.csv - data for the Plotly dashboard. This data is outputted by spotifyanalysis.ipynb
  • world-countries.json - world map file

Key Findings

See spotifynalaysis.ipynb for more key findings and code.

Ed Sheeran Dominated the Year

Ed Sheeran on Top

Songs Tend to Peak When They First Come Out and there is often a Spike near the End of the Year

Peak Plays

Spanish Speaking Country Listening Habits Have a Clear Cluster Separate from the Rest of the World

Spanish Cluster

About

Analysis of the Spotify Top 200 Daily Including Clustering Listening Behavior by Country


Languages

Language:Jupyter Notebook 99.7%Language:Python 0.3%