feyzaakyurek / rl4f

Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

feyzaakyurek/rl4f Stargazers