Recipes to train reward model for RLHF.
Home Page:https://rlhflow.github.io/
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool