Ferdinand Xu's repositories
spark-rapids
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
cudf
cuDF - GPU DataFrame Library
gluten
Gluten: Plugin to Double SparkSQL's Performance
oap-project.github.io
The OAP project web site
Gluten-Trino
Gluten: Plugin to Double Trino's Performance
BDTK
A modular acceleration toolkit for big data analytic engines
velox-1
A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
omniscidb
OmniSciDB (formerly MapD Core)
substrait
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
presto
Distributed SQL query engine for big data
sql-ds-cache
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
mars
Mars is a tensor-based unified framework for large-scale data computation which scales Numpy, Pandas and Scikit-learn.
OAP
Optimized Analytics Package for Spark Platform
ElasticDataFusion
Provide some utilities
SimpleCache
A simple version of Guava cache
kudu
Apache Kudu. Mirrored from https://github.com/apache/kudu
orc
Mirror of Apache Orc
hive
Mirror of Apache Hive
incubator-parquet-mr
Mirror of Apache Parquet
parquet-cpp
Mirror of Apache Parquet
parquet-format
Mirror of Apache Parquet
flink
Mirror of Apache Flink
arrow
Mirror of Apache Arrow
commons-crypto
Mirror of Apache Commons Crypto
hive-testbench
Testbench for experimenting with Apache Hive at any data scale.