Huizhuo Angela Yuan's repositories

Language:HTMLStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

SELM

The official implementation of Self-Exploring Language Models (SELM)

Language:PythonStargazers:0Issues:0Issues:0

SPPO

The official implementation of Self-Play Preference Optimization (SPPO)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

v202

Proceedings of ICML 2023

Language:TeXStargazers:0Issues:0Issues:0