likenneth / q_probe

Q-Probe: A Lightweight Approach to Reward Maximization for Language Models

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

likenneth/q_probe Stargazers