Cheng Luo (wdlctc)

wdlctc

Geek Repo

Company:Fudan university

Github PK Tool:Github PK Tool

Cheng Luo's repositories

Language:PythonStargazers:7Issues:0Issues:0

rtp

RTP: Rethinking Tensor Parallelism with Memory Deduplication

Language:PythonLicense:Apache-2.0Stargazers:6Issues:1Issues:0
Language:PythonStargazers:5Issues:0Issues:0

Pensieve-PPO

The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art RL algorithms, including PPO, DQN, and SAC

Language:PythonLicense:BSD-2-ClauseStargazers:3Issues:0Issues:0
Language:JavaScriptLicense:MITStargazers:1Issues:1Issues:0
Language:HTMLLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

License:Apache-2.0Stargazers:0Issues:0Issues:0

LASP

Linear Attention Sequence Parallelism (LASP)

License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0
Language:CudaLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

neuraloperator

Learning in infinite dimension with neural operators.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Open-Sora-old

Building your own video generation model like OpenAI's Sora

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PHPLicense:NOASSERTIONStargazers:0Issues:0Issues:0

SIMPLE

Selfplay In MultiPlayer Environments

License:GPL-3.0Stargazers:0Issues:0Issues:0

Speculative-Sampling

Implementation of Speculative Sampling as described in "Accelerating Large Language Model Decoding with Speculative Sampling" by Deepmind

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

streaming-llm

Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tensorly

TensorLy: Tensor Learning in Python.

License:NOASSERTIONStargazers:0Issues:0Issues:0

tltorch

TensorLy-Torch: Deep Tensor Learning with TensorLy and PyTorch

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:1Issues:0