aahouzi / llama2-chatbot-cpu

A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTorch with bfloat16.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

aahouzi/llama2-chatbot-cpu Issues

No issues in this repository yet.