Gleb Kanterov (kanterov)

kanterov

Geek Repo

Location:Stockholm, Sweden

Github PK Tool:Github PK Tool


Organizations
apache

Gleb Kanterov's starred repositories

system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Language:PythonLicense:NOASSERTIONStargazers:263395Issues:6620Issues:290

interactive-coding-challenges

120+ interactive Python coding interview challenges (algorithms and data structures). Includes Anki flashcards.

Language:PythonLicense:NOASSERTIONStargazers:29107Issues:962Issues:67

beam

Apache Beam is a unified programming model for Batch and Streaming data processing.

Language:JavaLicense:Apache-2.0Stargazers:7680Issues:261Issues:6829

error-prone

Catch common Java mistakes as compile-time errors

Language:JavaLicense:Apache-2.0Stargazers:6771Issues:162Issues:1644

flyte

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

Language:GoLicense:Apache-2.0Stargazers:5186Issues:258Issues:3049

weld

High-performance runtime for data analytics applications

Language:RustLicense:BSD-3-ClauseStargazers:2990Issues:111Issues:168

kcp

Kubernetes-like control planes for form-factors and use-cases beyond Kubernetes and container workloads.

Language:GoLicense:Apache-2.0Stargazers:2279Issues:36Issues:910

zetasql

ZetaSQL - Analyzer Framework for SQL

Language:C++License:Apache-2.0Stargazers:2161Issues:61Issues:134

Monocle

Optics library for Scala

Language:ScalaLicense:MITStargazers:1637Issues:53Issues:381

DataFixerUpper

A set of utilities designed for incremental building, merging and optimization of data transformations.

Language:JavaLicense:MITStargazers:1161Issues:93Issues:24

queryparser

Parsing and analysis of Vertica, Hive, and Presto SQL.

Language:HaskellLicense:MITStargazers:1070Issues:58Issues:36

pixiedust

Python Helper library for Jupyter Notebooks

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1036Issues:42Issues:425

pyspark-ai

English SDK for Apache Spark

Language:PythonLicense:Apache-2.0Stargazers:828Issues:19Issues:46

magnolia

Easy, fast, transparent generic derivation of typeclass instances

Language:ScalaLicense:Apache-2.0Stargazers:754Issues:45Issues:202

crucible

Crucible is a library for symbolic simulation of imperative programs

ananas-desktop

A hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.

Language:JavaLicense:Apache-2.0Stargazers:576Issues:16Issues:70

iceberg

Iceberg is a table format for large, slow-moving tabular data

Language:JavaLicense:Apache-2.0Stargazers:468Issues:347Issues:68

freelancing-in-sweden

The ultimate resource for becoming a freelancer in Sweden 🇸🇪 👨‍💻

nix-example

a way to develop software with Nix

build

Build Systems à la Carte

Language:TeXLicense:MITStargazers:239Issues:11Issues:6

ground

An open-source, vendor-neutral data context service.

Language:JavaLicense:Apache-2.0Stargazers:159Issues:39Issues:46

zetasketch

A collection of libraries for single-pass, distributed, sublinear-space approximate aggregation and sketching algorithms. Currently: HyperLogLog++; more to come.

Language:JavaLicense:Apache-2.0Stargazers:146Issues:11Issues:9

missinglink

Build time tool for detecting link problems in java projects

Language:JavaLicense:Apache-2.0Stargazers:143Issues:96Issues:30

bigquery-data-lineage

Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.

Language:JavaLicense:Apache-2.0Stargazers:141Issues:19Issues:17

xenomorph

Scala library for free applicative schemas capable of parsing/rendering sums-of-products data structures.

Language:ScalaLicense:LGPL-3.0Stargazers:108Issues:12Issues:4

icicle-ambiata

A streaming query language.

Language:HaskellLicense:BSD-3-ClauseStargazers:57Issues:28Issues:154

qcert

Compilation and Verification of Data-Centric Languages

Language:CoqLicense:Apache-2.0Stargazers:55Issues:6Issues:106

rust-shardio

Out-of-memory sorting of large datasets map / reduce style processing

Language:RustLicense:MITStargazers:48Issues:15Issues:5

avro-fastserde

Fast Apache Avro serialization/deserialization library

Language:JavaLicense:Apache-2.0Stargazers:43Issues:14Issues:8

flytekit-java

Java/Scala library for easily authoring Flyte tasks and workflows

Language:JavaLicense:Apache-2.0Stargazers:42Issues:24Issues:0