Yale-LILY / dart

Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Update Evaluation used for V.1.1.1

jordiclive opened this issue · comments

The references in /evaluation/dart_reference are not for the current version. Can you replace with the new references and share the tokenization script that is done to the predictions.

I am getting very different BLEU scores depending on tokenization, and how many references I use.
As there are up to ~30 for a few examples.

I would like to directly compare against the README leaderboard.

Upvoting this, since having the same issue here.

How to run BART in the model. Could you provide more details about running environment and python script.