jcolano / RLHF

Reinforcement Learning with Human Feedback

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RLHF

Reinforcement Learning with Human Feedback

Step 1: Create the rewards model

About

Reinforcement Learning with Human Feedback

License:Apache License 2.0


Languages

Language:Jupyter Notebook 100.0%