Hacky Huang's starred repositories
SpeculativeDecodingPapers
📰 Must-read papers and blogs on Speculative Decoding ⚡️
smart_router
A smart router to switch between GPT-3.5 and GPT-4 based on the hardness of the context. Aim to reduce cost while keeping the performance ≈ GPT-3¾.
Organized-LLM-Agents
Embodied and organized multi-LLM-agent teams supporting communication for >3 agents. Source codes for the paper "Embodied LLM Agents Learn to Cooperate in Organized Teams".
c4-dataset-script
Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese data processing and cleaning methods in MassiveText.
awesome-pydantic
A curated list of awesome things related to Pydantic! 🌪️
gpt_academic
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。
instructor
structured outputs for llms
LoRA-EXTRACTOR-Colab
A small script to extract LoRA models from custom checkpoints, in Google Colab.
LLM-Shearing
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Co-LLM-Agents
Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"
LLMSpeculativeSampling
Fast inference from large lauguage models via speculative decoding
UltraFeedback
A large-scale, fine-grained, diverse preference dataset (and models).
alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.