Beast code in Giters

jllllll's repositories

bitsandbytes-windows-webui

Windows compile of bitsandbytes for use in text-generation-webui.

Language:HTMLMIT336 8 29

llama-cpp-python-cuBLAS-wheels

Wheels for llama-cpp-python compiled with cuBLAS support

Language:HTMLUnlicense88 1 25

exllama

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Language:PythonMIT66 2 13

one-click-installers

Simplified installers for oobabooga/text-generation-webui.

Language:Batchfile55 7 8

flash-attention

Fast and memory-efficient exact attention - Windows wheels

Language:PythonBSD-3-Clause2201

bitsandbytes

8-bit CUDA functions for PyTorch

Language:PythonMIT2103

GPTQ-for-LLaMa-CUDA

A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.

Language:PythonApache-2.02000

GPTQ-for-LLaMa-Wheels

Precompiled Wheels for GPTQ-for-LLaMa

Unlicense18 3 2

ctransformers-cuBLAS-wheels

ctransformers wheels with pre-built CUDA binaries for additional CUDA and AVX versions.

Language:HTMLMIT12 10

windows-venv-installers

Standalone, dependency-less scripts for automatically setting up a virtual environment for easy project installation on Windows.

Language:PowerShellUnlicense7 10

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonMIT500

exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Language:PythonMIT3 10

text-generation-webui

A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.

Language:PythonAGPL-3.0200

h2ogpt

Private Q&A and summarization of documents+images or chat with local GPT, 100% private, no data leaks, Apache 2.0. Demo: https://gpt.h2o.ai/

Language:PythonApache-2.0100

GPTQ-for-LLaMa

4 bits quantization of LLMs using GPTQ

Language:PythonApache-2.0000

ctransformers

Python bindings for the Transformer models implemented in C/C++ using GGML library.

Language:CMIT000

safetensors

Simple, safe way to store and distribute tensors

Language:PythonApache-2.0000

scikit-build-core

A next generation Python CMake adaptor and Python API for plugins

Language:PythonApache-2.0000

SillyTavern

LLM Frontend for Power Users.

Language:JavaScriptAGPL-3.0000