Kunstenpunt / buitenlandse_concerten_grabber

Grabbing gigs from several concert aggregators

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Re/framing the international: Have Love, Will Travel

Kunstenpunt (internationally known as Flanders Arts Institute) investigates how Belgian live music acts operate internationally. To gather the data, we developed a process to aggregate concert information from online gig finder platforms, i.e. Songkick, setlist.fm, Bands in Town en Facebook (events). The process incorporates human-assisted cleaning algorithms and duplicate recognition. In addition, we keep a channel open to report concert info manually, and there has been an effort to do "one off" imports of podiuminfo.nl, festivalinfo.nl, and the now defunct Arts Flanders agenda (not yet implemented).

Aggregation flow

Automatic flow

Information about artists, including their identity on Songkick, Bands in Town, Setlist.fm and Facebook, is collected via the open database musicbrainz.org. Based on the identify information for the gig finder platforms in Musicbrainz, we can use the APIs of the gig finder platforms to gather the concert data.

Data cleaning

The gathered data needs to be cleaned:

  • within platform

    • we have to consider artists with the same name whose concerts are reported as coming from the same artist
    • we have to handle artists that perform under several names
    • infer cancellations of concerts
  • across platforms

    • we have to deduplicate concerts that are announced on several platforms
    • we have to harmonize venue names, place names, city names and country names

We are powerless against faulty reported data, but we do a best effort to clean up errors automatically. Nonetheless, there is the possibility to inform Flanders Arts Institute about errors. We have built in mechanisms to be able to allow for manually correcting the data.

Further information

April 2017

Juni 2017

Juli 2017

Augustus 2017

September 2017

Oktober 2017

November 2017

December 2017

About

Grabbing gigs from several concert aggregators

License:Apache License 2.0


Languages

Language:Python 100.0%