KYHuang's repositories
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Apache-2.0000
Mercury-Frontend
Mercury is very toxic!
Ph.D. candidate, Tongji University
Shanghai, China
A high-throughput and memory-efficient inference and serving engine for LLMs
Mercury is very toxic!