JasonLee's repositories
bahir-flink
Mirror of Apache Bahir Flink
bitsail
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.
calcite
Apache Calcite
dlink
Dlink & Apache Flink
dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
doris-flink-connector
Flink Connector for Apache Doris
feathub
FeatHub - A stream-batch unified feature store for real-time machine learning
flink
Apache Flink
flink-benchmarks
Benchmarks for Apache Flink
flink-cdc-connectors
Change Data Capture (CDC) Connectors for Apache Flink
flink-connector-jdbc
Apache flink
flink-remote-shuffle
Remote Shuffle Service for Flink
flinkx
基于flink的分布式数据同步工具
hadoop
Apache Hadoop
hudi
Upserts, Deletes And Incremental Processing on Big Data.
iotdb
Apache IoTDB
kafka-eagle
A easy and high-performance monitoring system, for comprehensive monitoring and management of kafka cluster.
kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
LakeSoul
A Table Structure Storage on Data Lakes to Unify Batch and Streaming Data Processing
nexmark
Benchmarks for queries over continuous data streams.
pulsar
Apache Pulsar - distributed pub-sub messaging system
spark
Apache Spark - A unified analytics engine for large-scale data processing
streampark
Make stream processing easier! easy-to-use stream processing application development framework and one-stop stream processing operation platform
zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.