princeton-nlp / SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

princeton-nlp/SimPO Watchers