VP-Spark This is the data processing pipeline used to create the topics and data at http://www.moodbox.ch. Feel free to comment :-)