zhaomin1423

Min Zhao's repositories

incubator-iceberg

Apache Iceberg (Incubating)

Language:JavaApache-2.0100

airbyte

Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.

Language:JavaNOASSERTION000

arctic

Arctic is a streaming lake warehouse service open sourced by NetEase

Language:JavaApache-2.0000

flink-cdc-connectors

Change Data Capture (CDC) Connectors for Apache Flink

Language:JavaApache-2.0010

BitSail is a distributed, high-performance data integration engine and provides global data integration solutions in batch, streaming, and incremental scenarios. At present, BitSail has been widely used and synchronizes hundreds of trillions data every day.

Apache-2.0000

blaze

Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.

Apache-2.0000

DataSphereStudio

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

Apache-2.0000

debezium

Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.

Language:JavaApache-2.0000

delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs for Scala, Java, Rust, Ruby, and Python.

Language:ScalaApache-2.0010

dolphinscheduler

Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.

Language:JavaApache-2.0010

elasticsearch-hadoop

:elephant: Elasticsearch real-time search and analytics natively integrated with Hadoop

Language:JavaApache-2.0000

example-custom-event-handler

Language:Scala000

gravitino

A high-performance, geo-distributed and federated metadata lake

Apache-2.0000

incubator-doris

Apache Doris (Incubating)

Language:C++Apache-2.0000

incubator-kyuubi-website

Apache Kyuubi Site

Language:HTMLApache-2.0000

incubator-linkis

Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.

Apache-2.0000

incubator-livy

Mirror of Apache livy (Incubating)

Language:ScalaApache-2.0010

incubator-paimon

Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.

Language:JavaApache-2.0000

incubator-seatunnel

SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).

Language:JavaApache-2.0010