Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool