HITSZ-HLT / CPPO

ICLR 2024 CPPO: Continual Learning for Reinforcement Learning with Human Feedback

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CPPO

codes of ICLR 2024 paper "CPPO: Continual Learning for Reinforcement Learning with Human Feedback"

About

ICLR 2024 CPPO: Continual Learning for Reinforcement Learning with Human Feedback


Languages

Language:Python 99.9%Language:Shell 0.1%