Mingliang Zhu's repositories
arrow-datafusion-comet
Apache Arrow DataFusion Comet Spark Accelerator
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs for Scala, Java, Rust, Ruby, and Python.
God-Of-BigData
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
incubator-celeborn
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
kyuubi
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
mlsql
The Programming Language Designed For Big Data and AI
rubix
Cache File System optimized for columnar formats and object stores
sentry
Access Server
spark
Apache Spark - A unified analytics engine for large-scale data processing
spark-excel
A Spark plugin for reading Excel files via Apache POI
velox
A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.