Huizhuo Angela Yuan's repositories
Language:HTML000
Language:Python000
SELM
The official implementation of Self-Exploring Language Models (SELM)
Language:Python000
SPPO
The official implementation of Self-Play Preference Optimization (SPPO)
Language:PythonApache-2.0000
TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:PythonApache-2.0000
trl
Train transformer language models with reinforcement learning.
Language:PythonApache-2.0000
v202
Proceedings of ICML 2023
Language:TeX000