yuanoOo's repositories
BigData-Notes
大数据入门指南 :star:
kylin
Apache Kylin
dlink
Dinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Batch & Streaming and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
dolphinscheduler
Apache DolphinScheduler is the modern data workflow orchestration platform with powerful user interface, dedicated to solving complex task dependencies in the data pipeline and providing various types of jobs available `out of the box`
druid
Apache Druid: a high performance real-time analytics database.
flink
Apache Flink
flink-cdc
Flink CDC is a streaming data integration tool
flink-connector-oceanbase
Apache Flink Connector for OceanBase.
flink-sql-lineage
FlinkSQL字段血缘解决方案及源码。FlinkSQL field lineage solution and source code, The core idea is to parse SQL through Calcite to generate a RelNode tree of relational expressions. Then get the optimized logical paln through optimization stage, and finally call Calcite RelMetadataQuery to get the lineage relationship at the field level.
flink-table-store-101
Playground for Flink Table Store with use cases and performance features
freenom
Freenom 域名自动续期。Freenom domain name renews automatically.
hudi
Upserts, Deletes And Incremental Processing on Big Data.
hudi-resources
汇总Apache Hudi相关资料
incubator-kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
incubator-paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
lu-raft-kv
raft-kv-storage 欢迎 star,凑够500
spark
Apache Spark - A unified analytics engine for large-scale data processing
spark-scala-examples
This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language