jllllll

jllllll

Geek Repo

0

following

0

stars

Github PK Tool:Github PK Tool

jllllll's repositories

bitsandbytes-windows-webui

Windows compile of bitsandbytes for use in text-generation-webui.

Language:HTMLLicense:MITStargazers:336Issues:8Issues:29

llama-cpp-python-cuBLAS-wheels

Wheels for llama-cpp-python compiled with cuBLAS support

Language:HTMLLicense:UnlicenseStargazers:86Issues:1Issues:25

exllama

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Language:PythonLicense:MITStargazers:66Issues:2Issues:13

one-click-installers

Simplified installers for oobabooga/text-generation-webui.

bitsandbytes

8-bit CUDA functions for PyTorch

Language:PythonLicense:MITStargazers:21Issues:0Issues:3

GPTQ-for-LLaMa-Wheels

Precompiled Wheels for GPTQ-for-LLaMa

flash-attention

Fast and memory-efficient exact attention - Windows wheels

Language:PythonLicense:BSD-3-ClauseStargazers:17Issues:0Issues:1

GPTQ-for-LLaMa-CUDA

A combination of Oobabooga's fork and the main cuda branch of GPTQ-for-LLaMa in a package format.

Language:PythonLicense:Apache-2.0Stargazers:17Issues:0Issues:0

ctransformers-cuBLAS-wheels

ctransformers wheels with pre-built CUDA binaries for additional CUDA and AVX versions.

Language:HTMLLicense:MITStargazers:12Issues:1Issues:0

windows-venv-installers

Standalone, dependency-less scripts for automatically setting up a virtual environment for easy project installation on Windows.

Language:PowerShellLicense:UnlicenseStargazers:7Issues:1Issues:0

AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Language:PythonLicense:MITStargazers:4Issues:0Issues:0

exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Language:PythonLicense:MITStargazers:3Issues:1Issues:0

text-generation-webui

A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.

Language:PythonLicense:AGPL-3.0Stargazers:2Issues:0Issues:0

h2ogpt

Private Q&A and summarization of documents+images or chat with local GPT, 100% private, no data leaks, Apache 2.0. Demo: https://gpt.h2o.ai/

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

GPTQ-for-LLaMa

4 bits quantization of LLMs using GPTQ

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ctransformers

Python bindings for the Transformer models implemented in C/C++ using GGML library.

Language:CLicense:MITStargazers:0Issues:0Issues:0

safetensors

Simple, safe way to store and distribute tensors

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

scikit-build-core

A next generation Python CMake adaptor and Python API for plugins

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SillyTavern

LLM Frontend for Power Users.

Language:JavaScriptLicense:AGPL-3.0Stargazers:0Issues:0Issues:0