zhihu / ZhiLight

A highly optimized LLM inference acceleration engine for Llama and its variants.

Repository from Github https://github.comzhihu/ZhiLightRepository from Github https://github.comzhihu/ZhiLight

zhihu/ZhiLight Stargazers