halfrot / ALaRM

[ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"

https://alarm-fdu.github.io/

halfrot/ALaRM Stargazers

Daxiong
18140663659
MMMmmm
Aureole-1210
Mengyu Bu
bingo123122121
cnxup
cnxupupup
Yuhang Lai
halfrot
Jaehyeok Lee
JaehyeokLee-119
Licong Guan
licongguan
lsjlsj35
pikepokenew
starstardd
Bob Serling
sunlibo2390
Tianbao Xie
Timothyxxx
WhitneyYan
hackaday
yqt
Zae Myung Kim
zaemyung

Links

ProductDiscover

Data Powerby api.github.com. Remove your profile on the Giters? Go to settings.

Contact Site Admin: Giters.