Francisco Piedrahita Velez's repositories

ALaRM

Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:LuaLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0