Francisco Piedrahita Velez's repositories
ALaRM
Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"
Language:PythonApache-2.0000
Language:LuaMIT000
000
Language:Jupyter NotebookMIT000
Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"