Kaffaljidhmah2

Hacky Huang's starred repositories

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause526900

SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

Apache-2.020100

smart_router

A smart router to switch between GPT-3.5 and GPT-4 based on the hardness of the context. Aim to reduce cost while keeping the performance ≈ GPT-3¾.

Language:Jupyter NotebookApache-2.0800

Organized-LLM-Agents

Embodied and organized multi-LLM-agent teams supporting communication for >3 agents. Source codes for the paper "Embodied LLM Agents Learn to Cooperate in Organized Teams".

Language:Python1200

c4-dataset-script

Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese data processing and cleaning methods in MassiveText.

Language:PythonMIT10800

gemma

Open weights LLM from Google DeepMind.

Language:Jupyter NotebookApache-2.0215400

grok-1

Grok open release

Language:PythonApache-2.04900400

awesome-pydantic

A curated list of awesome things related to Pydantic! 🌪️

MIT42500

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonGPL-3.06014800

instructor

structured outputs for llms

Language:PythonMIT612100

REST

REST: Retrieval-Based Speculative Decoding, NAACL 2024

Language:CApache-2.014300

langfun

Empower LLMs with Symbols.

Language:PythonApache-2.08300

LoRA-EXTRACTOR-Colab

A small script to extract LoRA models from custom checkpoints, in Google Colab.

Language:Python1000

ToRA

ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

Language:PythonMIT84800