Nan Zhu (CodingCat)

CodingCat

Geek Repo

Company:OpenAI

Location:Seattle

Home Page:http://codingcat.me/

Github PK Tool:Github PK Tool


Organizations
dmlc

Nan Zhu's repositories

xgboost4j-spark-scalability

a benchmark to test scalability of xgboost4j-spark and relevant projects

Language:HTMLStargazers:1Issues:1Issues:0

spark

Mirror of Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:1Issues:2Issues:0

analytics-zoo

Distributed Tensorflow, Keras and BigDL on Apache Spark

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

arrow-datafusion

Apache Arrow DataFusion and Ballista query engines

Language:RustLicense:Apache-2.0Stargazers:0Issues:1Issues:0

BigDL

BigDL: Distributed Deep Learning Library for Apache Spark

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

celeborn-website

Apache Celeborn Site

License:Apache-2.0Stargazers:0Issues:0Issues:0

cockroachdb-todo-apps

CockroachDB To-Do Apps

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

cockroachdb_playground

some programs to play around cockroachdb

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

delta

An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

dmlc-core

A common bricks library for building scalable and portable distributed machine learning.

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0

ec2-selector-cli

the cli tool to select ec2 instances based on filters

Language:RustLicense:Apache-2.0Stargazers:0Issues:1Issues:0

frameless

Expressive types for Spark.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

gazelle_plugin

Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

github-markdown-toc

Easy TOC creation for GitHub README.md

Language:ShellLicense:MITStargazers:0Issues:1Issues:0
Language:ScalaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

how-query-engines-work

This is the companion repository for the book How Query Engines Work.

Language:KotlinLicense:Apache-2.0Stargazers:0Issues:1Issues:0

iceberg

Apache Iceberg

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

incubator-celeborn

Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

incubator-sedona

A cluster computing framework for processing large-scale geospatial data

Language:JavaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

incubator-uniffle

Uniffle is a high performance, general purpose Remote Shuffle Service.

Language:JavaLicense:Apache-2.0Stargazers:0Issues:0Issues:0

morpheus

Morpheus brings the leading graph query language, Cypher, onto the leading distributed processing platform, Spark.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

noisepage

Self-Driving Database Management System from Carnegie Mellon University

Language:C++License:MITStargazers:0Issues:1Issues:0

rabit

Reliable Allreduce and Broadcast Interface for distributed machine learning

Language:C++License:BSD-3-ClauseStargazers:0Issues:1Issues:0

spark-lineage

Spark SQL listener to record lineage information

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

spark-sql-macros

Spark SQL Macros provides a mechanism similar to Spark User-Defined function registration; with the key enhancement being that custom code gets compiled to equivalent Catalyst Expressions at macro define time.

Language:ScalaLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:RustLicense:Apache-2.0Stargazers:0Issues:1Issues:0

terraform-aws-eks-node-group

Terraform module to provision a fully managed AWS EKS Node Group

Language:HCLLicense:Apache-2.0Stargazers:0Issues:1Issues:0

velox-intel

A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

xgboost

Large-scale and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, on single node, hadoop yarn and more.

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0