Beast code in Giters

Shuai Yuan's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonMIT167055 1556 2681

open-interpreter

A natural language interface for computers

Language:PythonAGPL-3.052380 397 935

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonApache-2.031616 200 4899

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.029382 339 268

ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Language:PythonNOASSERTION18287 94 217

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonApache-2.015976 107 1037

sentence-transformers

State-of-the-Art Text Embeddings

Language:PythonApache-2.014943 140 2139

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.09541 74 1124

tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Language:RustApache-2.08915 119 982

BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

Language:HTMLApache-2.07837 107 440

GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

Language:PythonMIT7449 160 251

OpenPrompt

An Open-Source Framework for Prompt-Learning.

Language:PythonApache-2.04312 43 256

GPT2-chitchat

GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI**)

Language:Python2980 41 118

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！

Language:Jupyter NotebookApache-2.02586 36 100

adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Language:Jupyter NotebookApache-2.02539 31 381

P-tuning-v2

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

Language:PythonApache-2.01968 29 75

CDial-GPT

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

Language:PythonMIT1765 28 108

t-few

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

Language:PythonMIT426 8 32

NCISurvey

Neural Code Intelligence Survey 2024; Reading lists and resources

MIT199 5 2

ALaCarte

Language:PythonMIT103 11 2

ControlPrefixes

Language:PythonApache-2.090 4 14

bert_seq2seq_DDP

bert_seq2seq的DDP版本，支持bert、roberta、nezha、t5、gpt2等模型，支持seq2seq、ner、关系抽取等任务，无需添加额外代码，轻松启动DDP多卡训练。

Language:PythonApache-2.045 2 2

QAlign

Language:Ruby32 1 1

user-simulator

Language:PythonApache-2.028 2 7

Corex

Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration

Language:Python15 10

Alpaca-Light

[Project] Tune LLaMA with Prefix/LoRA on English/Chinese instruction datasets

Language:Jupyter NotebookApache-2.010 1 1

reinforced-dialog-system-for-learning

Code for NAACL 2022 paper "Learning as Conversation: Dialogue Systems Reinforced for Information Acquisition". Using self-play and reinforcement learning to train a dialogue agent which aims at conveying knowledge to end user.

Language:Python7 5 2

Luciferder