A. Galán's repositories
nyTaxiEventAggregatorStreaming
Spark Streaming app that collects NY City taxi trips from Kafka queue, save raw data into HDFS/Parquet and generate OLAP Cubes within Cassandra. On the other hand, there is a benchmark to compare queries in HDFS vs OLAP Cubes
Cassandra_zeppelin_NY_Taxi_Trips
Cassandra modelling and CQL statements to storage denormalized NY City taxi trips events
nytaxieventaggregator_structured
Structured Streaming Spark app that collects data from Kafka queue, save raw data into HDFS/parquet and denormalized data ordered by dimension within Cassandra
DataSciencePythonCoursera
Course assignments Introduction to Data Science in Python
EventAggregationSparkRDD
Event Aggregator Spark RDD
EventSimulator
Event simulator. It takes data from csv file and injects row per row in a Kafka queue every 50ms
hadoop-mr-java-analisisLogs
Native Java MapReduce project. Linux Systems log analytics
HuffmanCode
Huffman Codification Scala