There are 1 repository under gptq topic.
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Large Language Models for All, 🦙 Cult and More, Stay in touch !
🪶 Lightweight OpenAI drop-in replacement for Kubernetes
Advanced Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"
A guide about how to use GPTQ models with langchain
ChatSakura:Open-source multilingual conversational model.(开源多语言对话大模型)
Private self-improvement coaching with open-source LLMs
This repository is for profiling, extracting, visualizing and reusing generative AI weights to hopefully build more accurate AI models and audit/scan weights at rest to identify knowledge domains for risk(s).
Run gguf LLM models in Latest Version TextGen-webui
This project will develop a NEPSE chatbot using an open-source LLM, incorporating sentence transformers, vector database and reranking.
Hands on some LLMs
Personal GitHub repository for stashing resources on Large Language Models (LLM), including Jupyter Notebooks on open source LLMs, use-cases with Langchain and R&D paper review.
Quantizing LLMs using GPTQ