Huizhuo Angela Yuan's repositories
alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Language:Jupyter NotebookApache-2.0000
Language:HTML000
Language:Python000
SELM
The official implementation of Self-Exploring Language Models (SELM)
Language:Python000
SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
Language:PythonApache-2.0000
TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:PythonApache-2.0000
trl
Train transformer language models with reinforcement learning.
Language:PythonApache-2.0000
v202
Proceedings of ICML 2023
Language:TeX000