4 bits quantization of LLaMA using GPTQ
⚠️ Deprecated : Helm charts for applications you run at home
A simple one-file way to run various GGML models with KoboldAI's UI
⚡ Building applications with LLMs through composability ⚡
Inference code for LLaMA models
A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, OPT, and GALACTICA.