openai/lm-human-preferences Issues
The link in readme is broke.
Updated 4Azure data path gives 404
Closed 2question related to the code
Closed 1Got an error that I can't trace
Updated 1PPO training
Closed 5
Code for the paper Fine-Tuning Language Models from Human Preferences