likenneth / honest_llama

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

likenneth/honest_llama Stargazers