Thomas's repositories
datasketches-memory
High performance native memory access for Java.
incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
datafusion
Apache DataFusion SQL Query Engine
duckdb
DuckDB is an analytical in-process SQL database management system
datafusion-comet
Apache DataFusion Comet Spark Accelerator
gravitino
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
oap-velox
A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
mns
mns
arroyo
Distributed stream processing engine in Rust
openhouse
Open Control Plane for Tables in Data Lakehouse
incubator-celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
northstar
国内最优秀的基于JAVA的AI开源量化交易平台,秒替文华、MC、金字塔。具备历史回放、策略研发、模拟交易、实盘交易等功能。兼顾全自动与半自动的使用场景。
incubator-fury
A blazing fast multi-language serialization framework powered by JIT and zero-copy.
spark
Mirror of Apache Spark
kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
compass
Compass is a task diagnosis platform for bigdata
sparklens
Qubole Sparklens tool for performance tuning Apache Spark
Qbot
[🔥updating ...] 自动量化交易机器人 Qbot is an AI-oriented quantitative investment platform, which aims to realize the potential, empower AI technologies in quantitative investment. https://ufund-me.github.io/Qbot :news: qbot-mini: https://github.com/Charmve/iQuant
summarize-github-pull-requests-paimon
Perform automatic code reviews for GitHub pull requests (PR). The review is triggered when a PR is created and is triggered again for every subsequent commit in the PR. The code review is conducted for commit in the PR.
incubator-paimon
Apache Paimon(incubating) is a streaming data lake platform that supports high-speed data ingestion, change data tracking and efficient real-time analytics.
summarize-github-pull-requests
Perform automatic code reviews for GitHub pull requests (PR). The review is triggered when a PR is created and is triggered again for every subsequent commit in the PR. The code review is conducted for commit in the PR.
doris
Apache Doris is an MPP-based interactive SQL data warehousing for reporting and analysis.
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
chitu-sdp
赤兔实时计算平台是基于 Apache Flink 构建的企业级、一站式、高性能、低门槛大数据实时计算平台,广泛适用于流式数据应用开发场景。
dataService
dataService platform is a low-code platform, which only needs to write SQL to realize the development of API services, solve the unification of data services, facilitate the governance of data services, and unify the caliber of indicators. It can improve the development efficiency of business and face business changes faster
VictoriaMetrics
VictoriaMetrics: fast, cost-effective monitoring solution and time series database
dolphinscheduler
Apache DolphinScheduler is the modern data workflow orchestration platform with powerful user interface, dedicated to solving complex task dependencies in the data pipeline and providing various types of jobs available `out of the box`
kubevela
The Modern Application Platform.