daveshap / RLHI

Reinforcement Learning with Heuristic Imperatives - Finetuning LLMs for Post-Conventional Moral Intuition

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

daveshap/RLHI Issues