bgulati / pipeline

Complete Pipeline Training at Big Data Scala By the Bay

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

pipeline

Join the chat at https://gitter.im/bythebay/pipeline Complete Pipeline Training at Big Data Scala By the Bay

Pipeline Description

Dating ratings data => Akka app => Kafka => Spark Streaming => Cassandra => Dashboard

In addition, Spark MLLib, DataFrames will be demonstrated using a combination of the Cassandra real time data plus static Parquet data, on a notebook interface.

Follow the Wiki to continue exploring -->

About

Complete Pipeline Training at Big Data Scala By the Bay


Languages

Language:Shell 86.7%Language:Scala 13.3%