There are 0 repository under automatic-evaluation topic.
MONSERRATE is a dataset specifically created to evaluate Question Generation systems. It has, on average, 26 questions associated to each source sentence, attempting to be an “exhaustive” reference.
Automatic Evaluation of Textual Answers on the famous Kaggle Automated Essay Scoring (AES) dataset.
Multidimensional Evaluation for Text Style Transfer Using ChatGPT. Human Judgement as a Compass to Navigate Automatic Metrics for Formality Transfer (HumEval 2022)
Success and Failure Linguistic Simplification Annotation 💃
An AI expert system to automatically evaluate subjective answers submitted in online assessments.