Data for use on Machine Learning Model to predict those who will listen The Beatles based on other artists. There are two data sets of importance:
- file_out_2495.csv a list of users who listened to at least 1 of the most 300 played artists. The columns are the play counts for each artists mentioned. The target is "Likes the Beatles"
- file_out_2495_tags.csv Same a above but with also a count of the genre distribution.
Instructions to re-generate w/ tags:
- Get a GCP Account and open the Jupyter notebook in Platform AI or DataLab
- Get a lastfm API account and edit the lastfm.conf
- Enable the free https://console.cloud.google.com/marketplace/details/metabrainz/listenbrainz database in BigQuery
- Run the code to get the data from BigQuery Data from BiqQuery.ipynb
- Run the code in Enrich_top_300.ipynb
- Run the code in listen_top_300.ipynb
The output file will be called file_out_2495_tags.csv
Questions: Brianhray@gmail.com