Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool