Rui Wang's repositories
CalciteParser
illustrate how to use Calcite Babel parser
incubator-ratis
Mirror of Apache Ratis (Incubating)
beam
Mirror of Apache Beam
calcite
Mirror of Apache Calcite
hadoop-ozone
Scalable, redundant, and distributed object store for Apache Hadoop
batch-processing-gateway
The gateway component to make Spark on K8s much easier for Spark users.
crawler4j
Open Source Web Crawler for Java
cs324_p2
Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)
DAMO-ConvAI
DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.
duckdb
DuckDB is an in-process SQL OLAP Database Management System
FlagEmbedding
Open-source Embeddings
hydroflow
Hydro's low-level dataflow runtime
incubator-doris
Apache Doris (Incubating)
incubator-uniffle
Uniffle is a high performance, general purpose Remote Shuffle Service.
libgrape-lite
🍇 A C++ library for parallel graph processing (GRAPE) 🍇
llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
llvm-tutor
A collection of out-of-tree LLVM passes for teaching and learning
lotusdb
Fast k/v storage compatible with lsm tree and b+tree, inspired by SLM-DB in USENIX FAST ’19.
paper-reading
深度学习经典、新论文逐段精读
parallel-hashmap
A family of header-only, very fast and memory-friendly hashmap and btree containers.
pumpkin-book
《机器学习》(西瓜书)公式详解
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
spark
Apache Spark - A unified analytics engine for large-scale data processing
SynapseML
Simple and Distributed Machine Learning
toplingdb
ToplingDB is a cloud native LSM Key-Value Store with searchable compression algo and distributed compaction
velox
A new C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
yugabyte-db
The high-performance distributed SQL database for global, internet-scale apps.