Hacky Huang's starred repositories

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5269Issues:0Issues:0

SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

License:Apache-2.0Stargazers:201Issues:0Issues:0

smart_router

A smart router to switch between GPT-3.5 and GPT-4 based on the hardness of the context. Aim to reduce cost while keeping the performance ≈ GPT-3¾.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8Issues:0Issues:0

Organized-LLM-Agents

Embodied and organized multi-LLM-agent teams supporting communication for >3 agents. Source codes for the paper "Embodied LLM Agents Learn to Cooperate in Organized Teams".

Language:PythonStargazers:12Issues:0Issues:0

c4-dataset-script

Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese data processing and cleaning methods in MassiveText.

Language:PythonLicense:MITStargazers:108Issues:0Issues:0

gemma

Open weights LLM from Google DeepMind.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2154Issues:0Issues:0

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49004Issues:0Issues:0

awesome-pydantic

A curated list of awesome things related to Pydantic! 🌪️

License:MITStargazers:425Issues:0Issues:0

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:60148Issues:0Issues:0

instructor

structured outputs for llms

Language:PythonLicense:MITStargazers:6121Issues:0Issues:0

REST

REST: Retrieval-Based Speculative Decoding, NAACL 2024

Language:CLicense:Apache-2.0Stargazers:143Issues:0Issues:0

langfun

Empower LLMs with Symbols.

Language:PythonLicense:Apache-2.0Stargazers:83Issues:0Issues:0

LoRA-EXTRACTOR-Colab

A small script to extract LoRA models from custom checkpoints, in Google Colab.

Language:PythonStargazers:10Issues:0Issues:0

ToRA

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

Language:PythonLicense:MITStargazers:848Issues:0Issues:0

LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Language:PythonLicense:MITStargazers:479Issues:0Issues:0

Co-LLM-Agents

Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"

Language:PythonStargazers:182Issues:0Issues:0

grace

[EMNLP 2023, Findings] GRACE: Discriminator-Guided Chain-of-Thought Reasoning

Language:PythonStargazers:36Issues:0Issues:0

LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

Language:PythonStargazers:404Issues:0Issues:0

gorilla

Gorilla: An API store for LLMs

Language:PythonLicense:Apache-2.0Stargazers:10449Issues:0Issues:0
Language:Jupyter NotebookStargazers:126Issues:0Issues:0

TheoremQA

The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset

Language:PythonLicense:MITStargazers:152Issues:0Issues:0

arxiv-gpt

An extension of chaotic_neural to visualize papers clustered using GPT-based embeddings

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonLicense:MITStargazers:2000Issues:0Issues:0
Language:PythonLicense:MITStargazers:205Issues:0Issues:0

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1947Issues:0Issues:0

FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

Language:C++License:Apache-2.0Stargazers:1564Issues:0Issues:0

UltraFeedback

A large-scale, fine-grained, diverse preference dataset (and models).

Language:PythonLicense:MITStargazers:270Issues:0Issues:0

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2115Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8479Issues:0Issues:0

alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Language:PythonLicense:Apache-2.0Stargazers:727Issues:0Issues:0