shamio's starred repositories
text-generation-webui
A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
lollms-webui
Lord of Large Language Models Web User Interface
intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
languagemodels
Explore large language models in 512MB of RAM
Long-Context
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval capabilities with context expansion. We also include key experimental results and instructions for reproducing and building on them.
neural-speed
An innovative library for efficient LLM inference via low-bit quantization
Entropy-ABF
Official implementation for 'Extending LLMs’ Context Window with 100 Samples'