SharathHebbar / dpo_chatgpt2

Direct Preference Optimization of ChatGPT2 using TRL Library

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

SharathHebbar/dpo_chatgpt2 Stargazers