Dahoas / reward-modeling

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Could you make a Google Colab? I would like to try training using the reward modeling strategy, but my coding abilities are limited.

BigSalmon2 opened this issue · comments

commented