Tommy Tran's starred repositories
iceberg-rust
Rust implementation of Apache Iceberg with integration for Datafusion
OpenLogReplicator
Open Source Oracle database CDC
rust-oracle
Oracle driver for Rust
karpenter-provider-aws
Karpenter is a Kubernetes Node Autoscaler built for flexibility, performance, and simplicity.
materialize
The Cloud Operational Data Store: use SQL to transform, deliver, and act on fast-changing data.
tidypolars
Tidy interface to polars
retentioneering-tools
Retentioneering: product analytics, data-driven CJM optimization, marketing analytics, web analytics, transaction analytics, graph visualization, process mining, and behavioral segmentation in Python. Predictive analytics over clickstream, AB tests, machine learning, and Markov Chain simulations.
go-fromjsonschema
[2017-2018, maintained, stable] generates Go type definitions (ready to `json.Unmarshal` into) from a JSON Schema definition (proper JSD, not just sample .json) file
dataframe-go
DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
pyarrow_ops
Convenient pyarrow operations following the Pandas API
datafusion-python
A Python library to run analytics workloads with the performance of Rust, the flexibility of Python and O(1) cost in moving data between the two. Uses Apache Arrow in-memory format and respective query engine DataFusion.
goodreads_etl_pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.