intel / neural-speed

An innovative library for efficient LLM inference via low-bit quantization

Home Page:https://github.com/intel/neural-speed

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

intel/neural-speed Issues