zhuzilin / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Home Page:https://docs.vllm.ai

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

zhuzilin/vllm Stargazers

No one’s star this repository yet.