mjkaufer / spo-tfidf-y

:cd::arrow_upper_right: Converting songs to vectors, and using said vectors to analyze artist discography

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Spo-tfidf-y

What is it?

I am trying to convert Spotify songs to vectors, so I can cluster like songs and analyze artist discography

What's with the title?

tf-idf stands for term frequency inverse document frequency, and is the way I generate the vectors

The example I'm running my code on is Kanye's discography – I input his albums, and then I query spotify until I get playlists with multiple of his songs in them. This playlist is added to a matrix that I eventually use to define the vectors of songs.

This project will compare the clusters formed from the vectorized versions of Kanye's songs to the implied clusters defined by his albums.

About

:cd::arrow_upper_right: Converting songs to vectors, and using said vectors to analyze artist discography


Languages

Language:Jupyter Notebook 91.9%Language:HTML 4.6%Language:Python 3.5%