Beast code in Giters

Cheng Luo's repositories

starlink-trace-tracker

Language:Python11 1 1

recurrent_maskable

Language:Python700

rtp

RTP: Rethinking Tensor Parallelism with Memory Deduplication

Language:PythonApache-2.06 10

MAPPO

Language:Python500

Pensieve-PPO

The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art RL algorithms, including PPO, DQN, and SAC

Language:PythonBSD-2-Clause300

pensieve

Language:JavaScriptMIT1 10

coconet

Language:HTMLMIT000

colab

Language:Python000

CUDAKernelEnergyPredictor

Language:Jupyter Notebook000

efficient_cross_entropy

MIT000

FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

Apache-2.0000

LASP

Linear Attention Sequence Parallelism (LASP)

MIT000

lc

Language:Python000

leo_compact

Language:Jupyter NotebookMIT020

nccl_test

Language:CudaBSD-3-Clause000

neuraloperator

Learning in infinite dimension with neural operators.

Language:PythonMIT000

Open-Sora-old

Building your own video generation model like OpenAI's Sora

Language:PythonApache-2.0000

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Apache-2.0000

PyTAG

Language:PythonMIT000

regicide

Language:Python000

sevenvice

Language:PHPNOASSERTION000

SIMPLE

Selfplay In MultiPlayer Environments

GPL-3.0000

Speculative-Sampling

Implementation of Speculative Sampling as described in "Accelerating Large Language Model Decoding with Speculative Sampling" by Deepmind

Language:PythonMIT000

streaming-llm

Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT000

tensorly

TensorLy: Tensor Learning in Python.

NOASSERTION000

tltorch

TensorLy-Torch: Deep Tensor Learning with TensorLy and PyTorch

Language:PythonBSD-3-Clause000

triton

Development repository for the Triton language and compiler

Language:C++MIT000

VisionLLaMA

000

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Apache-2.0000

wdlctc.github.io

Language:HTML010