There are 0 repository under exllama topic.
LLM telegram bot
Run gguf LLM models in Latest Version TextGen-webui
A lightweight, fast, parallel inference server for Llama
A constrained generation filter for local LLMs that makes them quote properly from a source document
This is a playground to explore the ExLlama project in a Windows environment.
A Python script designed to streamline the process of quantizing models to exllamav2 format