ThereOHM / playlist_scraper

Python online webradio playlist scraper

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Playlist Scraper

Small Python Tool, to webscrape webradio playlists.

Idea: There are plenty of good webradio stations. But I want their songs in a spotify, deezer, etc. playlist without commercials. Unfortunately the playlist of the most stations is resettet every couple of hours. The Python tool scrapes the songs, makes a proper data formatting, removes double values and saves an unique playlist per station. This playlist is then ready to import to your favourite music-streaming-service.

How To Use:

  • add the URLs of your favourite webradio stations to url_list.txt
  • I recommend using Onlineradiobox.com to get the playlists, because they have the total list of each day. Example: https://onlineradiobox.com/de/starfmhell/playlist/1?cs=de.starfmhell
  • run the main.py
  • the program will
    • scrape the playlist data of all stations
    • adds all the new songs to a playlist file, named by the station
    • remove duplicates and sort the songs alphabetical
    • save the playlist files
    • write some statistics to the stat.txt (for instance, how many new songs were added each day)

Comment: Keep an eye on the stats ;-)

It's really astonishing, that after about 5 Days the playlist of the most radio stations will be freezed to a few hundred songs. So after this 5 days, they just keep on playing the same stuff every day over and over again. ** Happy Groundhog Day :D**

About

Python online webradio playlist scraper

License:MIT License


Languages

Language:Python 100.0%