Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
wanghao-007 opened this issue 6 months ago · comments
code and dataset?