GumKey's repositories
seatunnel
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
doris
Apache Doris is an easy-to-use, high performance and unified analytics database.
starrocks
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. InfoWorld’s 2023 BOSSIE Award for best open source software.
presto
The official home of the Presto distributed SQL query engine for big data
coral
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
flink
Apache Flink
hudi
Upserts, Deletes And Incremental Processing on Big Data.
kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
iceberg
Apache Iceberg
amoro
Amoro is a Lakehouse management system built on open data lake formats.
alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
hadoop
Apache Hadoop
styleguide
Style guides for Google-originated open-source projects
bitsail
BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.
atlas
Apache Atlas
spline
Data Lineage Tracking And Visualization Solution
spline-spark-agent
Spline agent for Apache Spark
spark-atlas-connector
A Spark Atlas connector to track data lineage in Apache Atlas
streampark
Make stream processing easier! easy-to-use stream processing application development framework and one-stop stream processing operation platform
calcite
Apache Calcite
scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
antlr4
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
mysql-server
MySQL Server, the world's most popular open source database, and MySQL Cluster, a real-time, open source transactional database.
JavaGuide
「Java学习+面试指南」一份涵盖大部分 Java 程序员所需要掌握的核心知识。准备 Java 面试,首选 JavaGuide!
scrapyd
A service daemon to run Scrapy spiders
flink-cdc-connectors
CDC Connectors for Apache Flink®
backtrader
Python Backtesting library for trading strategies
datahub
A Generalized Metadata Search & Discovery Tool