Jelmer Kuperus's repositories
GildedRose-Refactoring-Kata
Starting code for the GildedRose Refactoring Kata in many programming languages.
spark
Mirror of Apache Spark
spark-nlp
State of the Art Natural Language Processing
polynote
A better notebook for Scala (and more)
badge.py
badge generator
guide-to-spark-partitioning-notebooks
Working code for Guide to Spark Partitioning (https://www.amazon.com/dp/B08KJCT3XN/)
KaHIP
The graph partitioning framework KaHIP -- Karlsruhe High Quality Partitioning.
kafka
Mirror of Apache Kafka
luminary-api
Hacking konrads luminary trophy
HNSW.Net
C# library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
sbt-docker-compose
sbt-docker-compose plugin provides ultimate solution for running integration tests against docker containers with health checking support.
flink
Mirror of Apache Flink
hadoop
Mirror of Apache Hadoop
catranking
practice project
TarsosLSH
A Java library implementing practical nearest neighbour search algorithm for multidimensional vectors that operates in sublinear time. It implements Locality-sensitive Hashing (LSH) and multi index hashing for hamming space.