Chen Shen (scv119)

scv119

Geek Repo

Company:Anyscale

Location:United States

Github PK Tool:Github PK Tool

Chen Shen's repositories

ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Language:PythonLicense:Apache-2.0Stargazers:2Issues:1Issues:0

awesome-tensor-compilers

A list of awesome compiler projects and papers for tensor computation and deep learning.

Stargazers:1Issues:0Issues:0

openmlsys-zh

《Machine Learning Systems: Design and Implementation》- Chinese Version

Stargazers:1Issues:0Issues:0
Stargazers:1Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

FasterTransformer

Transformer related optimization, including BERT, GPT

License:Apache-2.0Stargazers:0Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

flashinfer

FlashInfer: Kernel Library for LLM Serving

License:Apache-2.0Stargazers:0Issues:0Issues:0

grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

License:Apache-2.0Stargazers:0Issues:0Issues:0

how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Stargazers:0Issues:0Issues:0
Language:RustStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Lightrails

Yet another distributed training/inferencing framework.

License:Apache-2.0Stargazers:0Issues:0Issues:0

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

Megatron-LM

Ongoing research training transformer models at scale

License:NOASSERTIONStargazers:0Issues:0Issues:0

mini-redis

Incomplete Redis client and server implementation using Tokio - for learning purposes only

Language:RustLicense:MITStargazers:0Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

License:MITStargazers:0Issues:0Issues:0

og-equity-compensation

Stock options, RSUs, taxes — read the latest edition: www.holloway.com/ec

Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

orbit

A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

r4cppp

Rust for C++ programmers

License:NOASSERTIONStargazers:0Issues:0Issues:0

ScaleLLM

A high-performance inference system for large language models, designed for production environments.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

License:Apache-2.0Stargazers:0Issues:0Issues:0

The-Art-of-Linear-Algebra

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

License:CC0-1.0Stargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0