Get CitiBike real-time data using Spark Streaming
To run the project: sbt run
myReceiver.scala: define custom receiver class for Spark Streaming
Parsing.scala: define functions parsing JSON formats
bikeJob.scala: main funciton to run Spark Streaming job every 20s
To loop machine learning model: watch -n 20 ./run_model.sh
files in pyscript folder are for etl code and build machine learning models
just open index.html and it's our web application.