123penny123

followers

following

stars

Southeast University

chenggr's starred repositories

grok-1

Grok open release

Language:PythonApache-2.049198 560 202

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.033979 341 2655

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonApache-2.027521 185 4364

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonApache-2.012763 98 1032

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.08853 76 1007

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:Shell6451 40 677

GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonApache-2.05767 37 287

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT4403 49 285

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonApache-2.04257 111 124

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

Apache-2.03088 58 3

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonApache-2.03011 33 368

DecryptPrompt

总结Prompt&LLM论文，开源数据&模型，AIGC应用

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonApache-2.02134 26 54

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonApache-2.01951 50 125

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonApache-2.01896 19 77

Artificial-Intelligence-Terminology-Database

A comprehensive mapping database of English to Chinese technical vocabulary in the artificial intelligence domain

NOASSERTION1889 84 16

Llama-X

Open Academic Research on Improving LLaMA to SOTA LLM

Language:PythonApache-2.01582 41 21

self-rewarding-lm-pytorch

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Language:PythonMIT1273 23 17

MOSS-RLHF

MOSS-RLHF

Language:PythonApache-2.01235 34 51

machine-learning-interview

算法工程师-机器学习面试题总结

Xwin-LM

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

Language:Python1008 37 20

AI-interview-cards

最完整的AI算法面试题目仓库，1000道，25个类目

Paper-Reading-ConvAI

📖 Paper reading list in conversational AI (constantly updating 🤗).

LLM-Blender

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.

Language:PythonApache-2.0833 15 23

alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Language:PythonApache-2.0742 8 41

KnowledgeEditingPapers

[知识编辑] Must-read Papers on Knowledge Editing for Large Language Models.

rate-my-supervisor

Language:JavaScriptNOASSERTION718 80

Multitask-Learning

Awesome Multitask Learning Resources

UltraFeedback

A large-scale, fine-grained, diverse preference dataset (and models).

Language:PythonMIT285 10 13

Pixel-Navigator

Official GitHub Repository for Paper "Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill", ICRA 2024

Language:Python50 1 5