daveshap / RLHI

Reinforcement Learning with Heuristic Imperatives - Finetuning LLMs for Post-Conventional Moral Intuition

daveshap/RLHI Issues

Where is the code ?
Closed a year ago1
Article should list the Heuristic Imperatives
Closed a year ago1