gantuo

followers

following

stars

gantuo's starred repositories

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter Notebook1069600

support.996.ICU

Microsoft and GitHub Workers Support 996.ICU

NOASSERTION1010300

agentUniverse

agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications.

Language:PythonApache-2.060400

FollowBench

Code for "FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models (ACL 2024)"

Language:PythonApache-2.06200

prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Language:PythonMIT135800

InsTag

InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.02087100

math

The MATH Dataset (NeurIPS 2021)

Language:PythonMIT78600

chat-dataset-baseline

人工精调的中文对话数据集和一段chatglm的微调代码

Language:Jupyter Notebook112100

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptNOASSERTION3874500

Eurus

Language:PythonApache-2.025000

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonApache-2.0185600

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookMIT8967000

tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language:PythonMIT442400

ComposeOverscroll

Overscroll any scrollable items!

Language:KotlinGPL-3.016600

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

MIT227800

evol-teacher

Open Source WizardCoder Dataset

Language:PythonApache-2.014400

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Language:PythonMIT2340200

KwaiAgents

A generalized information-seeking agent system with Large Language Models (LLMs).

Language:PythonNOASSERTION104800

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonMIT46700

ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Language:PythonApache-2.0463300

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookApache-2.0134100

GPT-4-LLM

Instruction Tuning with GPT-4

Language:HTMLApache-2.0410900

UltraFeedback

A large-scale, fine-grained, diverse preference dataset (and models).

Language:PythonMIT28600

UltraChat

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Language:PythonMIT218300

Topical-Chat

A dataset containing human-human knowledge-grounded open-domain conversations.

Language:Python60600

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.03587300

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.02356500

Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Language:PythonApache-2.0403800