Jiajia Li's repositories
alibabacloud-jindo-sdk
alibabacloud-jindo-sdk
aliyun-emapreduce-datasources
Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.
alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
directory-kerby
Mirror of Apache Directory Kerby
hadoop
Mirror of Apache Hadoop
spark
Mirror of Apache Spark
docker-hadoop
Apache Hadoop docker image
dremio-flight-connector
Dremio Flight connector. Access Dremio using Arrow flight
dremio-oss
Dremio - the missing link in modern data
grpc
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
grpc-java
The Java gRPC implementation. HTTP/2 based RPC
Impala
Real-time Query for Hadoop; mirror of Apache Impala
kubernetes-HDFS
Repository holding configuration files for running an HDFS cluster in Kubernetes
kudu
Mirror of Apache Kudu
libhdfs3
libhdfs3 a native c/c++ hdfs client
OAP
Optimized Analytics Package for Spark Platform
parquet-cpp
Apache Parquet
parquet-format
Apache Parquet
parquet-mr
Apache Parquet
pegasus
An accelerated data and cache service for big data based on Apache Arrow Flight
ranger
Mirror of Apache Ranger
ray
A fast and simple framework for building and running distributed applications.
rubix
Cache File System optimized for columnar formats and object stores
serving
A flexible, high-performance serving system for machine learning models
spark-on-k8s-operator
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
spark-terasort
Spark Terasort