joeljang / RLPHF

Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

joeljang/RLPHF Stargazers