monster's repositories
paimon-quickstart
Beginner learn Paimon with paion on flink from official website.
arrow-datafusion
Apache Arrow DataFusion SQL Query Engine
bistoury
Bistoury是去哪儿网的java应用生产问题诊断工具,提供了一站式的问题诊断方案
coder-kung-fu
开发内功修炼
dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
dolphinscheduler-ambari
Apache DolphinScheduler's Ambari plugin, deploy DolphinScheduler easier within Apache Ambari
dolphinscheduler-datawarehouse
Apache DolphinScheduler data warehouse.
flink-connector-clickhouse
Flink SQL connector for ClickHouse. Support ClickHouseCatalog and read/write primary data, maps, arrays to clickhouse.
flink-sql-lineage
FlinkSQL字段血缘解决方案及源码
flink-sql-security
FlinkSQL的行级权限解决方案及源码,支持面向用户级别的行级数据访问控制,即特定用户只能访问授权过的行,隐藏未授权的行数据。此方案是实时领域Flink的解决方案,类似离线数仓Hive中Ranger Row-level Filter方案。
gitlab4j-api
GitLab4J API (gitlab4j-api) provides a full featured Java client library for working with GitLab repositories via the GitLab REST API
golphinscheduler
Developed based on go ecology, cloud native scheduling system
iceberg
Apache Iceberg
incubator-celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
incubator-devlake
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
incubator-hugegraph
A graph database that supports more than 100+ billion data, high performance and scalability (Include OLTP Engine & REST-API & Backends)
incubator-kvrocks
Kvrocks is a distributed key value NoSQL database that uses RocksDB as storage engine and is compatible with Redis protocol.
incubator-opendal
Apache OpenDAL: Access data freely, painlessly, and efficiently.
incubator-paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
incubator-seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
incubator-uniffle
Uniffle is a high performance, general purpose Remote Shuffle Service.
inlong
Apache InLong - a one-stop integration framework for massive data
java
Official Java client library for kubernetes
juicefs
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
langchain
⚡ Building applications with LLMs through composability ⚡
LSM-Tree
Log-Structured Merge Tree Java implementation
risingwave
The distributed streaming database: SQL stream processing with Postgres-like experience 🪄. 10X faster and more cost-efficient than Apache Flink 🚀.
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
ververica-platform-playground
Instructions for getting started with Ververica Platform on minikube.
yunikorn-core
Apache YuniKorn Core