Gleb Kanterov's starred repositories
system-design-primer
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
interactive-coding-challenges
120+ interactive Python coding interview challenges (algorithms and data structures). Includes Anki flashcards.
error-prone
Catch common Java mistakes as compile-time errors
DataFixerUpper
A set of utilities designed for incremental building, merging and optimization of data transformations.
queryparser
Parsing and analysis of Vertica, Hive, and Presto SQL.
pyspark-ai
English SDK for Apache Spark
ananas-desktop
A hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.
freelancing-in-sweden
The ultimate resource for becoming a freelancer in Sweden 🇸🇪 👨💻
nix-example
a way to develop software with Nix
zetasketch
A collection of libraries for single-pass, distributed, sublinear-space approximate aggregation and sketching algorithms. Currently: HyperLogLog++; more to come.
missinglink
Build time tool for detecting link problems in java projects
bigquery-data-lineage
Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.
icicle-ambiata
A streaming query language.
rust-shardio
Out-of-memory sorting of large datasets map / reduce style processing
avro-fastserde
Fast Apache Avro serialization/deserialization library
flytekit-java
Java/Scala library for easily authoring Flyte tasks and workflows