jiamingkong / rwkv_reward

Training a reward model for RLHF using RWKV.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

jiamingkong/rwkv_reward Stargazers