RdEl00 / Spotify-API-ETL-Pipeline

Spotify ETL Project

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Spotify ETL Project

In this project, I built a simple data pipeline to extract my tracks record using the spotify api and save everyday listened songs into a local sqlite db.

I achieved this project by following Karolina's data engineering for beginners' course. I highly recommend checking her Youtube channel.

Motivation

I wanted to create a simple ETL which would help develop my Data engineering skills. Spotify API was a good place to explore this.

Architecture


Dashboard

Final output from Google Data Studio. Link here.

Google officially rebranded its Data Studio product to be named Looker Studio in October 2022

2 days listening history from records.sqlite

Setting up the Project

  • Clone the repository using
git clone https://github.com/RdEl00Spotify-API-ETL-Pipeline.git
  • Open the terminal/cmd and navigate to the project folder.
cd Spotify-API-ETL-Pipeline
  • Install the requirments.txt using
pip install -r requirments.txt
  • Open the script.py file, add your spotify username & the token.
USER_ID = ""
TOKEN = ""

Usage

  • Now run the script.py using
python script.py

Congrats! your 24h tracks dataset should now be created :)

About

Spotify ETL Project


Languages

Language:Python 100.0%