Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool