OpenLLMAI / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (Support 70B+ full tuning & LoRA & Mixtral & KTO)

Home Page:https://arxiv.org/abs/2405.11143

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

OpenLLMAI/OpenRLHF Stargazers