Robert Schmidtke's repositories
collectl-slurm
Scripts to run collectl on Slurm
flink-slurm
Scripts to run Flink Standalone on Slurm
aws-data-wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
flatbuffers
Memory Efficient Serialization Library
moto
A library that allows you to easily mock out tests based on AWS infrastructure.
pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
cashews
Cache with async power
flink
Mirror of Apache Flink
flink-benchmarks
Variety of benchmarks, mostly comparing Flink on HDFS vs. XtreemFS
flink-kafka-skeleton
Blank project that correctly bundles Kafka dependencies in a Flink fat jar.
hadoop
Mirror of Apache Hadoop
hdfs-statistics-adapter
Wrapper around HDFS collecting statistics.
multichain
Source code for multichaind, multichain-cli and multichain-util.
peel
Peel is a framework that helps you to define, execute, analyze, and share experiments for distributed systems and algorithms.
protobuf
Protocol Buffers - Google's data interchange format
python-holidays
Generate and work with holidays in Python
spark
Mirror of Apache Spark
streaming-benchmarks
Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...
xtreemfs
Distributed Fault-Tolerant File System