vicgalle / refined-dpo

Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs

vicgalle/refined-dpo Stargazers

babu111
babybirdprd
Bin Fu
BinFuPKU
bowersjames
Ficus
Fiiicus
Joseph Cheng
indiejoseph
Iván Moreno
ivanvmoreno
Jeff Carpenter
JeffCarpenter
Jason Poulos
jvpoulos
Mario Garcia
mexicanamerican
Víctor Gallego
vicgalle

Links

ProductDiscover

Data Powerby api.github.com. Remove your profile on the Giters? Go to settings.

Contact Site Admin: Giters.