Spotify ETL Project
In this project, I built a simple data pipeline to extract my tracks record using the spotify api and save everyday listened songs into a local sqlite db.
I achieved this project by following Karolina's data engineering for beginners' course. I highly recommend checking her Youtube channel.
Motivation
I wanted to create a simple ETL which would help develop my Data engineering skills. Spotify API was a good place to explore this.
Architecture
- Extract data using Spotify API
- Load data into Sqlite
Dashboard
Final output from Google Data Studio. Link here.
Google officially rebranded its Data Studio product to be named Looker Studio in October 2022
2 days listening history from
records.sqlite
Setting up the Project
- Clone the repository using
git clone https://github.com/RdEl00Spotify-API-ETL-Pipeline.git
- Open the terminal/cmd and navigate to the project folder.
cd Spotify-API-ETL-Pipeline
- Install the requirments.txt using
pip install -r requirments.txt
- Open the script.py file, add your spotify username & the token.
USER_ID = ""
TOKEN = ""
Usage
- Now run the
script.py
using
python script.py
Congrats! your 24h tracks dataset should now be created :)