Lee Gao (leegao)

leegao

Geek Repo

Company:Google

Location:Jersey City, NJ

Home Page:http://phailed.me/

Github PK Tool:Github PK Tool

Lee Gao's starred repositories

llama.cpp

LLM inference in C/C++

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:19922Issues:192Issues:2817

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Language:PythonLicense:Apache-2.0Stargazers:17276Issues:164Issues:1086

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

txtai

💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows

Language:PythonLicense:Apache-2.0Stargazers:7148Issues:79Issues:686

chat-ui

Open source codebase powering the HuggingChat app

Language:TypeScriptLicense:Apache-2.0Stargazers:6425Issues:77Issues:453

llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

Language:PythonLicense:Apache-2.0Stargazers:3894Issues:38Issues:112

Anima

33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3326Issues:97Issues:128

rift

Rift: an AI-native language server for your personal AI software engineer

Language:PythonLicense:Apache-2.0Stargazers:3017Issues:30Issues:87

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:1964Issues:24Issues:109

EasyEdit

An Easy-to-use Knowledge Editing Framework for LLMs.

Language:Jupyter NotebookLicense:MITStargazers:1449Issues:18Issues:209

FastEdit

🩹Editing large language models within 10 seconds⚡

Language:PythonLicense:Apache-2.0Stargazers:1202Issues:14Issues:25

yarn

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonLicense:MITStargazers:1184Issues:14Issues:52

attention_sinks

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Language:PythonLicense:Apache-2.0Stargazers:643Issues:12Issues:29

KnowledgeEditingPapers

Must-read Papers on Knowledge Editing for Large Language Models.

augmentoolkit

Convert Compute And Books Into Instruct-Tuning Datasets

Language:PythonLicense:MITStargazers:460Issues:12Issues:12

laser

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

Language:PythonLicense:MITStargazers:324Issues:22Issues:19

ai-clone-whatsapp

Create an AI clone of yourself from your WhatsApp chats (using Llama 3)

Language:PythonLicense:NOASSERTIONStargazers:290Issues:7Issues:7

laserRMT

This is our own implementation of 'Layer Selective Rank Reduction'

Language:PythonLicense:Apache-2.0Stargazers:214Issues:10Issues:7

LLaMa2lang

Convenience scripts to finetune (chat-)LLaMa3 and other models for any language

Language:PythonLicense:Apache-2.0Stargazers:205Issues:10Issues:37

large-sequence-modeling

Transformers with Arbitrarily Large Context, No Approximations

Language:PythonLicense:Apache-2.0Stargazers:187Issues:4Issues:8

LLM-Alchemy-Chamber

a friendly neighborhood repository with diverse experiments and adventures in the world of LLMs

Language:Jupyter NotebookLicense:MITStargazers:115Issues:1Issues:1

selfextend

an implementation of Self-Extend, to expand the context window via grouped attention

Language:PythonLicense:Apache-2.0Stargazers:114Issues:4Issues:5

gbnfgen

TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces

Language:TypeScriptLicense:MITStargazers:113Issues:2Issues:3

langfun

Empower LLMs with Symbols.

Language:PythonLicense:Apache-2.0Stargazers:79Issues:5Issues:1

Entropy-ABF

Official implementation for 'Extending LLMs’ Context Window with 100 Samples'

BrainHackingChip

Brain-Hacking Chip: inject negative prompts directly into your LLM's thoughts with this oobabooga extension!

Language:PythonLicense:AGPL-3.0Stargazers:48Issues:5Issues:5

SelfExtend

Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta

Language:PythonLicense:MITStargazers:11Issues:2Issues:0

laserRMT-encoder

Fork of Fernandos implementation of 'Layer Selective Rank Reduction'

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6Issues:0Issues:0