liziniu / ReMax

Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

liziniu/ReMax Stargazers