Alessandro Solimando's repositories
logmap-conservativity
LogMap extension for conservativity principle
calcite-examples
Examples and experimentation around Apache Calcite
kafkasparkdruid
Example of reading from a Kafka topic via Spark Streaming and writing into Druid via Tranquility library
xqueryprojector
XQuery query processing optimization based on XML projection
async-profiler
Sampling CPU and HEAP profiler for Java featuring AsyncGetCallTrace + perf_events
calcite-avatica
Mirror of Apache Calcite - Avatica
hive
Apache Hive
trap2017spark
Analysis of TRAP2017 dataset using Spark
beam
Apache Beam is a unified programming model for Batch and Streaming
dremio-oss
Dremio - the missing link in modern data
hive-benchmark
Automated TPC-DS and TPC-H benchmark for Apache Hive LLAP
KLLdeserializer
Utility to deserialize and print KLL data sketches from their binary representation
ksql
The database purpose-built for stream processing applications.
mlflow
Open source platform for the machine learning lifecycle
particlesimulator
A simple particle simulator
random-datagen
A generator of Random Data to HDFS, HBase, Hive, Kafka, Kudu, Ozone, SolR in CDP (Cloudera Data Platform)
streaming-examples
Streaming Frameworks Examples
tez
Apache Tez
tpcds-kit
TPC-DS benchmark kit with some modifications/fixes
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)