princeton-nlp / LLMBar

[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following

Home Page:https://arxiv.org/abs/2310.07641

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

princeton-nlp/LLMBar Issues