Lee Gao (leegao)

leegao

Geek Repo

Company:Google

Location:Jersey City, NJ

Home Page:http://phailed.me/

Github PK Tool:Github PK Tool

Lee Gao's starred repositories

llama.cpp

LLM inference in C/C++

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:27800Issues:228Issues:4681

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonLicense:Apache-2.0Stargazers:18799Issues:171Issues:1361

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

txtai

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Language:PythonLicense:Apache-2.0Stargazers:8818Issues:87Issues:754

chat-ui

Open source codebase powering the HuggingChat app

Language:TypeScriptLicense:Apache-2.0Stargazers:7324Issues:79Issues:567

llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

Language:PythonLicense:Apache-2.0Stargazers:4615Issues:43Issues:133

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4583Issues:50Issues:302

rift

Rift: an AI-native language server for your personal AI software engineer

Language:PythonLicense:Apache-2.0Stargazers:3081Issues:29Issues:87

EasyEdit

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Language:Jupyter NotebookLicense:MITStargazers:1814Issues:22Issues:308

notebooks

Collection of notebook guides created by the Brev.dev team!

Language:Jupyter NotebookLicense:MITStargazers:1634Issues:25Issues:17

yarn

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonLicense:MITStargazers:1316Issues:14Issues:56

FastEdit

🩹Editing large language models within 10 seconds⚡

Language:PythonLicense:Apache-2.0Stargazers:1273Issues:15Issues:27

augmentoolkit

Convert Compute And Books Into Instruct-Tuning Datasets! Makes: QA, RP, Classifiers.

Language:PythonLicense:MITStargazers:926Issues:20Issues:38

KnowledgeEditingPapers

Must-read Papers on Knowledge Editing for Large Language Models.

attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Language:PythonLicense:Apache-2.0Stargazers:663Issues:12Issues:30

ringattention

Transformers with Arbitrarily Large Context

Language:PythonLicense:Apache-2.0Stargazers:571Issues:5Issues:15

honest_llama

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Language:PythonLicense:MITStargazers:447Issues:9Issues:37

langfun

OO for LLMs

Language:PythonLicense:Apache-2.0Stargazers:442Issues:6Issues:5

laser

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

Language:PythonLicense:MITStargazers:361Issues:22Issues:22

ai-clone-whatsapp

Create an AI clone of yourself from your WhatsApp chats (using Llama 3)

Language:PythonLicense:NOASSERTIONStargazers:339Issues:9Issues:10

LLaMa2lang

Convenience scripts to finetune (chat-)LLaMa3 and other models for any language

Language:PythonLicense:Apache-2.0Stargazers:262Issues:12Issues:38

laserRMT

This is our own implementation of 'Layer Selective Rank Reduction'

Language:PythonLicense:Apache-2.0Stargazers:229Issues:10Issues:7

LLM-Alchemy-Chamber

a friendly neighborhood repository with diverse experiments and adventures in the world of LLMs

Language:Jupyter NotebookLicense:MITStargazers:136Issues:1Issues:1

gbnfgen

TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces

Language:TypeScriptLicense:MITStargazers:128Issues:2Issues:7

selfextend

an implementation of Self-Extend, to expand the context window via grouped attention

Language:PythonLicense:Apache-2.0Stargazers:117Issues:4Issues:5

Entropy-ABF

Official implementation for 'Extending LLMs’ Context Window with 100 Samples'

SelfExtend

Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta

Language:PythonLicense:MITStargazers:13Issues:3Issues:0

laserRMT-encoder

Fork of Fernandos implementation of 'Layer Selective Rank Reduction'

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7Issues:0Issues:0

Anima

Moved to here: https://github.com/lyogavin/airllm

License:Apache-2.0Stargazers:5Issues:2Issues:0