RockeyCoss / SPO

Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step

Home Page:https://arxiv.org/abs/2406.04314

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RockeyCoss/SPO Stargazers