Xu_Ruijie's starred repositories
self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
DeepSeek-LLM
DeepSeek LLM: Let there be answers
ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
relative-preference-optimization
Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts
alignment-handbook
Robust recipes to align language models with human and AI preferences
DAMO-ConvAI
DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.
text-generation-inference
Large Language Model Text Generation Inference
Self-Contrast
Extensive Self-Contrast Enables Feedback-Free Language Model Alignment
chain-of-thought-hub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
lm-evaluation-harness
A framework for few-shot evaluation of language models.
lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
LLaMA-Factory
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)