GGgary666 / LLM-Quantization-Practice

Model quantization and inference.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

LLM-Quantization-Practice

Model quantization and inference.

About

Model quantization and inference.


Languages

Language:C++ 67.1%Language:Jupyter Notebook 29.8%Language:Python 1.2%Language:CMake 1.1%Language:Makefile 0.3%Language:Metal 0.2%Language:Cuda 0.1%Language:Shell 0.0%Language:Jinja 0.0%Language:Starlark 0.0%Language:C 0.0%Language:HTML 0.0%Language:Meson 0.0%Language:CSS 0.0%