jiaoqingbo's repositories
ambari
Apache Ambari simplifies provisioning, managing, and monitoring of Apache Hadoop clusters.
ambari-Kylin
Ambari集成Apache Kylin服务(离线部署、可支持HDP2.6+及HDP3.0+)
apisix-java-plugin-runner
APISIX Plugin Runner in Java
AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
flink
Apache Flink
flink-cdc-connectors
Change Data Capture (CDC) Connectors for Apache Flink
flink-playgrounds
Apache Flink Playgrounds
flink-remote-shuffle
Remote Shuffle Service for Flink
flink-table-store
An Apache Flink subproject to provide storage for dynamic tables.
flink-training
Apache Flink Training Excercises
hadoop
Apache Hadoop
hive
Apache Hive
hudi
Upserts, Deletes And Incremental Processing on Big Data.
gravitino
A high-performance, geo-distributed and federated metadata lake
incubator-celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
incubator-kyuubi
Apache Kyuubi is a distributed multi-tenant JDBC server for large-scale data processing and analytics, built on top of Apache Spark
incubator-kyuubi-website
Apache Kyuubi Site
incubator-uniffle
Uniffle is a high performance, general purpose Remote Shuffle Service.
kyuubi-client
Apache kyuubi
ldapsdk
UnboundID LDAP SDK for Java
nexmark
Benchmarks for queries over continuous data streams.
orc
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
ranger
Mirror of Apache Ranger
recipes
The Immerok Apache Flink Cookbook is a collection of examples of Apache Flink applications in the format of "recipes". Each recipe explains how you can solve a specific problem by leveraging one or more of the APIs of Apache Flink. The recipes can be extended or provide a basis for solving your requirements with Apache Flink.
spark
Apache Spark - A unified analytics engine for large-scale data processing
submarine
Submarine is Cloud Native Machine Learning Platform.
tez
Apache Tez
tpch
Port of TPC-H dbgen to Java
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
trino-the-definitive-guide
Resource for the book Trino: The Definitive Guide (and formerly Presto: The Definitive Guide)