data used for evaluation?
malkin1729 opened this issue · comments
Would it be possible to share the generated text used to compute metrics for SSD-LM and baselines?
I am interested in doing some analysis of the outputs with respect to measures beyond those used in the paper and hope to avoid rerunning the full generation.
Thank you (and thank you for the generally well-documented code and interesting paper!).