tencent-ailab / inferflow

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

tencent-ailab/inferflow Watchers