minyang-chen / RLHF_example

Reinforcement learning from human feedback (RLHF) Movie Reviews Example

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RLHF_example

Reinforcement learning from human feedback (RLHF) Movie Reviews Example

About

Reinforcement learning from human feedback (RLHF) Movie Reviews Example

License:Apache License 2.0


Languages

Language:Jupyter Notebook 100.0%