chenggr (123penny123)

123penny123

Geek Repo

Company:Southeast University

Github PK Tool:Github PK Tool

chenggr's starred repositories

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonLicense:Apache-2.0Stargazers:1921Issues:0Issues:0

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:5855Issues:0Issues:0

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:12420Issues:0Issues:0

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49146Issues:0Issues:0

machine-learning-interview

算法工程师-机器学习面试题总结

Stargazers:1163Issues:0Issues:0

Artificial-Intelligence-Terminology-Database

A comprehensive mapping database of English to Chinese technical vocabulary in the artificial intelligence domain

License:NOASSERTIONStargazers:1879Issues:0Issues:0

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:2126Issues:0Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4386Issues:0Issues:0

alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Language:PythonLicense:Apache-2.0Stargazers:738Issues:0Issues:0

alignment-handbook

Robust recipes to align language models with human and AI preferences

Language:PythonLicense:Apache-2.0Stargazers:4173Issues:0Issues:0

UltraFeedback

A large-scale, fine-grained, diverse preference dataset (and models).

Language:PythonLicense:MITStargazers:278Issues:0Issues:0

LLM-Blender

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.

Language:PythonLicense:Apache-2.0Stargazers:823Issues:0Issues:0

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

License:Apache-2.0Stargazers:3004Issues:0Issues:0

self-rewarding-lm-pytorch

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Language:PythonLicense:MITStargazers:1266Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8676Issues:0Issues:0

GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:5573Issues:0Issues:0

MOSS-RLHF

MOSS-RLHF

Language:PythonLicense:Apache-2.0Stargazers:1211Issues:0Issues:0

Multitask-Learning

Awesome Multitask Learning Resources

Stargazers:627Issues:0Issues:0

direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Language:PythonLicense:Apache-2.0Stargazers:1832Issues:0Issues:0

Xwin-LM

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

Language:PythonStargazers:1005Issues:0Issues:0

Llama-X

Open Academic Research on Improving LLaMA to SOTA LLM

Language:PythonLicense:Apache-2.0Stargazers:1582Issues:0Issues:0

DecryptPrompt

总结Prompt&LLM论文,开源数据&模型,AIGC应用

Stargazers:2362Issues:0Issues:0
Language:JavaScriptLicense:NOASSERTIONStargazers:686Issues:0Issues:0

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。

Language:PythonLicense:Apache-2.0Stargazers:2932Issues:0Issues:0

Paper-Reading-ConvAI

📖 Paper reading list in dialogue systems and natural language generation (constantly updating 🤗).

Stargazers:959Issues:0Issues:0

KnowledgeEditingPapers

Must-read Papers on Knowledge Editing for Large Language Models.

License:MITStargazers:705Issues:0Issues:0

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:25534Issues:0Issues:0

AI-interview-cards

最完整的AI算法面试题目仓库,1000道,25个类目

Stargazers:977Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33667Issues:0Issues:0

Pixel-Navigator

Official GitHub Repository for Paper "Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill", ICRA 2024

Language:PythonStargazers:45Issues:0Issues:0