There are 0 repository under llm-as-judge topic.
Code and data for ACL ARR 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"
Evaluate translations by either a self-hosted Embedder or using Chat-GPT as LLM-as-judge.