gantuo's starred repositories

llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter NotebookStargazers:10696Issues:0Issues:0

support.996.ICU

Microsoft and GitHub Workers Support 996.ICU

License:NOASSERTIONStargazers:10103Issues:0Issues:0

agentUniverse

agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications.

Language:PythonLicense:Apache-2.0Stargazers:604Issues:0Issues:0

FollowBench

Code for "FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models (ACL 2024)"

Language:PythonLicense:Apache-2.0Stargazers:62Issues:0Issues:0

prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Language:PythonLicense:MITStargazers:1358Issues:0Issues:0

InsTag

InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning

Stargazers:157Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20871Issues:0Issues:0

math

The MATH Dataset (NeurIPS 2021)

Language:PythonLicense:MITStargazers:786Issues:0Issues:0

chat-dataset-baseline

人工精调的中文对话数据集和一段chatglm的微调代码

Language:Jupyter NotebookStargazers:1121Issues:0Issues:0

dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

Language:TypeScriptLicense:NOASSERTIONStargazers:38745Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:250Issues:0Issues:0

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonLicense:Apache-2.0Stargazers:1856Issues:0Issues:0

langchain

🦜🔗 Build context-aware reasoning applications

Language:Jupyter NotebookLicense:MITStargazers:89670Issues:0Issues:0

tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language:PythonLicense:MITStargazers:4424Issues:0Issues:0

ComposeOverscroll

Overscroll any scrollable items!

Language:KotlinLicense:GPL-3.0Stargazers:166Issues:0Issues:0

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

License:MITStargazers:2278Issues:0Issues:0

evol-teacher

Open Source WizardCoder Dataset

Language:PythonLicense:Apache-2.0Stargazers:144Issues:0Issues:0

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Language:PythonLicense:MITStargazers:23402Issues:0Issues:0

KwaiAgents

A generalized information-seeking agent system with Large Language Models (LLMs).

Language:PythonLicense:NOASSERTIONStargazers:1048Issues:0Issues:0

textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Language:PythonLicense:MITStargazers:467Issues:0Issues:0

ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Language:PythonLicense:Apache-2.0Stargazers:4633Issues:0Issues:0

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

Stargazers:5851Issues:0Issues:0

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1341Issues:0Issues:0

GPT-4-LLM

Instruction Tuning with GPT-4

Language:HTMLLicense:Apache-2.0Stargazers:4109Issues:0Issues:0

UltraFeedback

A large-scale, fine-grained, diverse preference dataset (and models).

Language:PythonLicense:MITStargazers:286Issues:0Issues:0

UltraChat

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Language:PythonLicense:MITStargazers:2183Issues:0Issues:0

Topical-Chat

A dataset containing human-human knowledge-grounded open-domain conversations.

Language:PythonStargazers:606Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35873Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:23565Issues:0Issues:0

Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Language:PythonLicense:Apache-2.0Stargazers:4038Issues:0Issues:0