Beast code in Giters

Lee Gao's starred repositories

llama.cpp

LLM inference in C/C++

Language:C++MIT65776 549 3816

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.027800 228 4681

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonApache-2.018799 171 1361

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

Apache-2.016328 133 126

txtai

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Language:PythonApache-2.08818 87 754

chat-ui

Open source codebase powering the HuggingChat app

Language:TypeScriptApache-2.07324 79 567

llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

Language:PythonApache-2.04615 43 133

mergekit

Tools for merging pretrained large language models.

Language:PythonLGPL-3.04583 50 302

rift

Rift: an AI-native language server for your personal AI software engineer

Language:PythonApache-2.03081 29 87

EasyEdit

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Language:Jupyter NotebookMIT1814 22 308

notebooks

Collection of notebook guides created by the Brev.dev team!

Language:Jupyter NotebookMIT1634 25 17

yarn

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonMIT1316 14 56

FastEdit

🩹Editing large language models within 10 seconds⚡

Language:PythonApache-2.01273 15 27

augmentoolkit

Convert Compute And Books Into Instruct-Tuning Datasets! Makes: QA, RP, Classifiers.

Language:PythonMIT926 20 38

KnowledgeEditingPapers

Must-read Papers on Knowledge Editing for Large Language Models.

MIT852 27 8

attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Language:PythonApache-2.0663 12 30

ringattention

Transformers with Arbitrarily Large Context

Language:PythonApache-2.0571 5 15

honest_llama

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Language:PythonMIT447 9 37

langfun

OO for LLMs

Language:PythonApache-2.0442 6 5

laser

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

Language:PythonMIT361 22 22

ai-clone-whatsapp

Create an AI clone of yourself from your WhatsApp chats (using Llama 3)

Language:PythonNOASSERTION339 9 10

LLaMa2lang

Convenience scripts to finetune (chat-)LLaMa3 and other models for any language

Language:PythonApache-2.0262 12 38

laserRMT

This is our own implementation of 'Layer Selective Rank Reduction'

Language:PythonApache-2.0229 10 7

LLM-Alchemy-Chamber

a friendly neighborhood repository with diverse experiments and adventures in the world of LLMs

Language:Jupyter NotebookMIT136 1 1

gbnfgen

TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces

Language:TypeScriptMIT128 2 7

selfextend

an implementation of Self-Extend, to expand the context window via grouped attention

Language:PythonApache-2.0117 4 5

Entropy-ABF

Official implementation for 'Extending LLMs’ Context Window with 100 Samples'

Language:Python73 3 2

SelfExtend

Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta

Language:PythonMIT13 30

laserRMT-encoder

Fork of Fernandos implementation of 'Layer Selective Rank Reduction'

Language:Jupyter NotebookApache-2.0700

Anima

Moved to here: https://github.com/lyogavin/airllm

Apache-2.05 20