There are 0 repository under language-model-evaluation topic.
A library & tools to evaluate predictive language models.
Curriculum is a new format of NLI benchmark for evaluation of broad-coverage linguistic phenomena. This linguistic-phenomena-driven benchmark can serve as an effective tool for diagnosing model behavior and verifying model learning quality.
Language Modeling