jovany-wang / trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

100

jovany-wang/trlx Stargazers

Roman
Good4lien

Links

ProductDiscover

Data Powerby api.github.com. Remove your profile on the Giters? Go to settings.

Contact Site Admin: Giters.