shivpalSW / Optimized-CPU-Implementation-of-Llama2

Optimized CPU Implementation of Llama2-LLM

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Optimized-CPU-Implementation-of-Llama2

Optimized CPU Implementation of Llama2

Implimented :-

"TheBloke/Llama-2-7B-Chat-GGML" 4-bit Model from Huggingface Hub Model Link

Simple UI on local

alt text

About

Optimized CPU Implementation of Llama2-LLM

License:MIT License


Languages

Language:Python 70.7%Language:HTML 29.3%