huggingface / trl

Train transformer language models with reinforcement learning.

Home Page:http://hf.co/docs/trl

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

huggingface/trl Watchers