A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.
Home Page:https://milanlproc.github.io/publication/2021-honest-hurtful-language-model/
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool