The motivation for the project was the analysis of traffic accidents in New York. Also, the goal of the project was to learn about Apache Kafka, Apache Spark, Apache Hadoop, Docker ...
Dataset: https://catalog.data.gov/dataset/motor-vehicle-collisions-vehicles (600MB)
API: https://www.worldweatheronline.com/developer/