Official code for the paper "RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation".
Authors: Kaiqu Liang, Haimin Hu, Ryan Liu, Tom Griffiths, Jaime Fernández Fisac.
Code will be coming soon!
RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation
https://rl-hindsight.github.io/
Repository from Github https://github.comSafeRoboticsLab/RLHS
Official code for the paper "RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation".
Authors: Kaiqu Liang, Haimin Hu, Ryan Liu, Tom Griffiths, Jaime Fernández Fisac.
Code will be coming soon!