Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from easy questions to hard
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool