Chen Shen's repositories
speculative-decoding
Explorations into some recent techniques surrounding speculative decoding
flash-attention
Fast and memory-efficient exact attention
Language:PythonBSD-3-Clause000
FlexFlow
A distributed deep learning framework.
Language:C++Apache-2.0000
LaTeX-TeXWiki
给大家普及本已普及了的 LaTeX 知识 - LaTeX 工作室 ( wenda.latexstudio.net )
000
lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Language:PythonApache-2.0000
Language:Jupyter Notebook000
REST
REST: Retrieval-Based Speculative Decoding, NAACL 2024
Language:CApache-2.0000
thefuck
Magnificent app which corrects your previous console command.
Language:PythonMIT000
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:PythonApache-2.0000