zeliu's repositories
kylin-storm-plugin
build kylin cube realtime with storm
arroyo
Distributed stream processing engine in Rust
bitsail
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.
calcite
Apache Calcite
cnosdb
An Open Source Distributed Time Series Database with high performance, high compression ratio and high usability.
flink
Apache Flink
flink-cdc-connectors
CDC Connectors for Apache Flink®
hudi
Upserts, Deletes And Incremental Processing on Big Data.
DroidDrops
梳理下自己之前写过的文章
druid
Column oriented distributed data store ideal for powering interactive applications
incubator-paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
incubator-seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
iotdb
Apache IoTDB
pyppeteer
Headless chrome/chromium automation library (unofficial port of puppeteer)
ratis
Open source Java implementation for Raft consensus protocol.
risingwave
🚀SQL stream processing with Postgres-like experience. 🪄More than a modern alternative to Apache Flink.
scrapy
Linkedin Scraper
tidb
TiDB is a distributed NewSQL database compatible with MySQL protocol
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)