chenggr's starred repositories
machine-learning-interview
算法工程师-机器学习面试题总结
Artificial-Intelligence-Terminology-Database
A comprehensive mapping database of English to Chinese technical vocabulary in the artificial intelligence domain
alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
alignment-handbook
Robust recipes to align language models with human and AI preferences
UltraFeedback
A large-scale, fine-grained, diverse preference dataset (and models).
LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Multitask-Learning
Awesome Multitask Learning Resources
direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
MedicalGPT
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
Paper-Reading-ConvAI
📖 Paper reading list in dialogue systems and natural language generation (constantly updating 🤗).
KnowledgeEditingPapers
Must-read Papers on Knowledge Editing for Large Language Models.
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
AI-interview-cards
最完整的AI算法面试题目仓库,1000道,25个类目
Pixel-Navigator
Official GitHub Repository for Paper "Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill", ICRA 2024