Leonhard Spiegelberg's repositories
tuplex-public
Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code. Tuplex has similar Python APIs to Apache Spark or Dask, but rather than invoking the Python interpreter, Tuplex generates optimized LLVM bytecode for the given pipeline and input data set.
CRIUForJava
Prototype showing how to use CRIU with Java
dissertation
Matteo Riondato's PhD dissertation
grizzly-prototype
Grizzly: Efficient Stream Processing Through Adaptive Query Compilation
JitFromScratch
Example project from my talks in the LLVM Social Berlin and C++ User Group
tpch-spark
TPC-H queries in Apache Spark SQL using native DataFrames API