Yang Jiang's repositories
arrow-datafusion
Apache Arrow DataFusion and Ballista query engines
arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
arrow-ballista
Apache Arrow DataFusion and Ballista query engines
arrow-datafusion-comet
Apache Arrow DataFusion Comet Spark Accelerator
arrow-rs
Official Rust implementation of Apache Arrow
arrow-site
Mirror of Apache Arrow site
arrow2
Unofficial transmute-free Rust library to work with the Arrow format
blaze
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
ByConity
ByConity is an open source cloud-native data warehouse
datafuse
An elastic and scalable Cloud Warehouse, offers Blazing Fast Query and combines Elasticity, Simplicity, Low cost of the Cloud, built to make the Data Cloud easy
datafusion-objectstore-hdfs
HDFS based on Java implementation as a remote ObjectStore for DataFusion
duckdb
DuckDB is an in-process SQL OLAP Database Management System
hadoop
Apache Hadoop
hyperloglog.rs
HyperLogLog implementations.
kylin
Apache Kylin
log
Logging implementation for Rust
parquet-mr
Apache Parquet
parquet-testing
Auxiliary files for compatibility and integration tests for Apache Parquet
parquet2
Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow
risinglight
An OLAP database system for educational purpose
sled
the champagne of beta embedded databases
spark
Apache Spark
tantivy
Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
the-algorithm
Source code for Twitter's Recommendation Algorithm
tinyvec
Just, really the littlest Vec you could need. So smol.
trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
wickdb
Pure Rust LSM-tree based embedded storage engine