Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm
Home Page:https://github.com/intel-analytics/bigdl
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool