Data and Code for the ACL 2019 paper "Multi-News: a Large-Scale Multi-Document Summarization Dataset and Abstractive Hierarchical Model."
Update:
Google Drive link for preprocessed dataset.
Link to unprocessed data (only replaced \n with "NEWLINE_CHAR" and append "|||||" to the end of each story).
Zipped unprocessed data.