opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

What is the main library to scale up RL training for LLMs?

aldopareja opened this issue · comments

Assuming you have a reward model (say open assistant reward model) and a target model (say LLaMA), and you want to train it at scale on a multinode setup. What is the best code base for this? DeepSeed-chat?