Wenjun Ruan's repositories
dolphinscheduler
Dolphin Scheduler is a distributed and easy-to-extend visual workflow scheduling platform, dedicated to solving the complex dependencies in data processing, making the scheduling system out of the box for data processing.(分布式易扩展的可视化工作流任务调度)
mybatis-3
MyBatis SQL mapper framework for Java
seatunnel
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
shardingsphere-elasticjob
Distributed scheduled job framework
bookkeeper
Apache Bookkeeper
COLA
🥤 COLA: Clean Object-oriented & Layered Architecture
commons-graph
Apache Commons Graph (Sandbox)
DataX
DataX是阿里云DataWorks数据集成的开源版本。
dolphinscheduler-website
Apache DolphinScheduler website
doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
flink
Apache Flink
gravitino
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
iceberg
Apache Iceberg
incubator-inlong
Apache InLong
incubator-streampark
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
JSqlParser
JSqlParser parses an SQL statement and translate it into a hierarchy of Java classes. The generated hierarchy can be navigated using the Visitor Pattern
kestra
Kestra is an infinitely scalable orchestration and scheduling platform, creating, running, scheduling, and monitoring millions of complex pipelines.
kubernetes
Production-Grade Container Scheduling and Management
kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
logback
The reliable, generic, fast and flexible logging framework for Java.
presto
The official home of the Presto distributed SQL query engine for big data
pulsar
Apache Pulsar - distributed pub-sub messaging system
ruanwenjun.github.io
My blog
seatunnel-web
SeaTunnel is a distributed, high-performance data integration platform for the synchronization and transformation of massive data (offline & real-time).
slf4j
Simple Logging Facade for Java
spark
Apache Spark
starrocks
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
volcano
A Cloud Native Batch System (Project under CNCF)
zeebe
Distributed Workflow Engine for Microservices Orchestration
zookeeper
Apache ZooKeeper