Automatic evaluation for natural language generation (NLG) systems. It takes input as pairs of generated sentece and references and outputs values of metrics.
- BLEU
- Distinct-N
- METEOR
- ROUGE
- CIDEr
and more
[WIP]
[WIP]
[WIP]
This repo is based on the following repositories: