agaca / nyTaxiEventAggregatorStreaming

Spark Streaming app that collects NY City taxi trips from Kafka queue, save raw data into HDFS/Parquet and generate OLAP Cubes within Cassandra. On the other hand, there is a benchmark to compare queries in HDFS vs OLAP Cubes

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repository is not active

About

Spark Streaming app that collects NY City taxi trips from Kafka queue, save raw data into HDFS/Parquet and generate OLAP Cubes within Cassandra. On the other hand, there is a benchmark to compare queries in HDFS vs OLAP Cubes


Languages

Language:Scala 100.0%