Shuai Yuan's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:167055Issues:1556Issues:2681

open-interpreter

A natural language interface for computers

Language:PythonLicense:AGPL-3.0Stargazers:52380Issues:397Issues:935

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:31616Issues:200Issues:4899

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:29382Issues:339Issues:268

ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Language:PythonLicense:NOASSERTIONStargazers:18287Issues:94Issues:217

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15976Issues:107Issues:1037

sentence-transformers

State-of-the-Art Text Embeddings

Language:PythonLicense:Apache-2.0Stargazers:14943Issues:140Issues:2139

HumanSystemOptimization

健康学习到150岁 - 人体系统调优不完全指南

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:9541Issues:74Issues:1124

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Language:RustLicense:Apache-2.0Stargazers:8915Issues:119Issues:982

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7837Issues:107Issues:440

GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

Language:PythonLicense:MITStargazers:7449Issues:160Issues:251

stopwords

中文常用停用词表(哈工大停用词表、百度停用词表等)

OpenPrompt

An Open-Source Framework for Prompt-Learning.

Language:PythonLicense:Apache-2.0Stargazers:4312Issues:43Issues:256

GPT2-chitchat

GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI**)

Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2586Issues:36Issues:100

adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2539Issues:31Issues:381

P-tuning-v2

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

Language:PythonLicense:Apache-2.0Stargazers:1968Issues:29Issues:75

CDial-GPT

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

Language:PythonLicense:MITStargazers:1765Issues:28Issues:108

t-few

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

Language:PythonLicense:MITStargazers:426Issues:8Issues:32

NCISurvey

Neural Code Intelligence Survey 2024; Reading lists and resources

Language:PythonLicense:MITStargazers:103Issues:11Issues:2
Language:PythonLicense:Apache-2.0Stargazers:90Issues:4Issues:14

bert_seq2seq_DDP

bert_seq2seq的DDP版本,支持bert、roberta、nezha、t5、gpt2等模型,支持seq2seq、ner、关系抽取等任务,无需添加额外代码,轻松启动DDP多卡训练。

Language:PythonLicense:Apache-2.0Stargazers:45Issues:2Issues:2
Language:PythonLicense:Apache-2.0Stargazers:28Issues:2Issues:7

Corex

Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration

Language:PythonStargazers:15Issues:1Issues:0

Alpaca-Light

[Project] Tune LLaMA with Prefix/LoRA on English/Chinese instruction datasets

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10Issues:1Issues:1

reinforced-dialog-system-for-learning

Code for NAACL 2022 paper "Learning as Conversation: Dialogue Systems Reinforced for Information Acquisition". Using self-play and reinforcement learning to train a dialogue agent which aims at conveying knowledge to end user.