Ferdinand Xu's repositories
arrow
Mirror of Apache Arrow
awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
BDTK
A modular acceleration toolkit for big data analytic engines
commons-crypto
Mirror of Apache Commons Crypto
cudf
cuDF - GPU DataFrame Library
ElasticDataFusion
Provide some utilities
flink
Mirror of Apache Flink
gluten
Gluten: Plugin to Double SparkSQL's Performance
Gluten-Trino
Gluten: Plugin to Double Trino's Performance
hive
Mirror of Apache Hive
hive-testbench
Testbench for experimenting with Apache Hive at any data scale.
incubator-parquet-mr
Mirror of Apache Parquet
kudu
Apache Kudu. Mirrored from https://github.com/apache/kudu
mars
Mars is a tensor-based unified framework for large-scale data computation which scales Numpy, Pandas and Scikit-learn.
OAP
Optimized Analytics Package for Spark Platform
oap-project.github.io
The OAP project web site
omniscidb
OmniSciDB (formerly MapD Core)
orc
Mirror of Apache Orc
parquet-cpp
Mirror of Apache Parquet
parquet-format
Mirror of Apache Parquet
presto
Distributed SQL query engine for big data
SimpleCache
A simple version of Guava cache
spark-rapids
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
sql-ds-cache
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
substrait
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
velox-1
A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.