KevinZhangt / llama-int8

Quantized inference code for LLaMA models

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repository is not active

About

Quantized inference code for LLaMA models

License:GNU General Public License v3.0


Languages

Language:Python 93.2%Language:Shell 6.8%