A. Galán's repositories

nyTaxiEventAggregatorStreaming

Spark Streaming app that collects NY City taxi trips from Kafka queue, save raw data into HDFS/Parquet and generate OLAP Cubes within Cassandra. On the other hand, there is a benchmark to compare queries in HDFS vs OLAP Cubes

Language:ScalaStargazers:3Issues:1Issues:0

Cassandra_zeppelin_NY_Taxi_Trips

Cassandra modelling and CQL statements to storage denormalized NY City taxi trips events

nytaxieventaggregator_structured

Structured Streaming Spark app that collects data from Kafka queue, save raw data into HDFS/parquet and denormalized data ordered by dimension within Cassandra

Language:ScalaStargazers:1Issues:1Issues:0

DataSciencePythonCoursera

Course assignments Introduction to Data Science in Python

Language:Jupyter NotebookStargazers:0Issues:1Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:2Issues:0

EventAggregationSparkRDD

Event Aggregator Spark RDD

Language:ScalaStargazers:0Issues:1Issues:0

EventSimulator

Event simulator. It takes data from csv file and injects row per row in a Kafka queue every 50ms

Language:PythonStargazers:0Issues:0Issues:0

hadoop-mr-java-analisisLogs

Native Java MapReduce project. Linux Systems log analytics

Language:JavaStargazers:0Issues:1Issues:0

HuffmanCode

Huffman Codification Scala

Language:ScalaStargazers:0Issues:1Issues:0
Language:Jupyter NotebookStargazers:0Issues:1Issues:0