plusplusjiajia

followers

following

stars

Alibaba

Shanghai

Organizations

apache

Jiajia Li's repositories

alibabacloud-jindo-sdk

alibabacloud-jindo-sdk

Apache-2.0000

aliyun-emapreduce-datasources

Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.

Language:ScalaArtistic-2.0000

alluxio

Alluxio, data orchestration for analytics and machine learning in the cloud

Language:JavaApache-2.0000

arrow

Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.

Language:C++Apache-2.0000

directory-kerby

Mirror of Apache Directory Kerby

Language:JavaApache-2.0000

hadoop

Mirror of Apache Hadoop

Language:JavaApache-2.0000

spark

Mirror of Apache Spark

Language:ScalaApache-2.0000

spark-adaptive

Language:ScalaApache-2.0000

docker-hadoop

Apache Hadoop docker image

Language:Shell000

dremio-flight-connector

Dremio Flight connector. Access Dremio using Arrow flight

Language:JavaApache-2.0010

dremio-oss

Dremio - the missing link in modern data

Apache-2.0000

flight-spark-source

Language:JavaApache-2.0000

grpc

The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)

Apache-2.0000

grpc-java

The Java gRPC implementation. HTTP/2 based RPC

Apache-2.0000

Impala

Real-time Query for Hadoop; mirror of Apache Impala

Apache-2.0000

kubernetes-HDFS

Repository holding configuration files for running an HDFS cluster in Kubernetes

Language:ShellApache-2.0000

kudu

Mirror of Apache Kudu

Language:C++Apache-2.0000

libhdfs3

libhdfs3 a native c/c++ hdfs client

Apache-2.0000

OAP

Optimized Analytics Package for Spark Platform

000

parquet-cpp

Apache Parquet

Apache-2.0000

parquet-format

Apache Parquet

Apache-2.0000

parquet-mr

Apache Parquet

Apache-2.0000

pegasus

An accelerated data and cache service for big data based on Apache Arrow Flight

000

ranger

Mirror of Apache Ranger

Apache-2.0000

ray

A fast and simple framework for building and running distributed applications.

Language:PythonApache-2.0000

rubix

Cache File System optimized for columnar formats and object stores

Apache-2.0000

serving

A flexible, high-performance serving system for machine learning models

Apache-2.0000

spark-on-k8s-operator

Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

Apache-2.0000

spark-sql-perf

Apache-2.0000

spark-terasort

Spark Terasort

Language:JavaApache-2.0000