imohammad12 / GRS

An unsupervised approach to sentence simplification that combines text generation and text revision.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GRS

Code for our paper "GRS: Combining Generation and Revision in Unsupervised Sentence Simplification" accepted at Findings of the Association for Computational Linguistics: ACL 2022.

Setup

$ conda env create --file environment.yml
$ python src/main.py

You will also need to have a CoreNLP Server running on port 9000. You can download the package from [here]

Constituent Models

The GRS system is composed of multiple pre-trained models. These models can be used independently from the GRS system. We have uploaded these models in the hugging face hub. You can find them [here].

Citation

Please cite this paper if you use our code or system output.

@inproceedings{dehghan-etal-2022-grs,
    title = "{GRS}: Combining Generation and Revision in Unsupervised Sentence Simplification",
    author = "Dehghan, Mohammad  and
      Kumar, Dhruv  and
      Golab, Lukasz",
    booktitle = "Findings of the Association for Computational Linguistics: ACL 2022",
    month = may,
    year = "2022",
    address = "Dublin, Ireland",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.findings-acl.77",
    pages = "949--960",
    abstract = "We propose GRS: an unsupervised approach to sentence simplification that combines text generation and text revision. We start with an iterative framework in which an input sentence is revised using explicit edit operations, and add paraphrasing as a new edit operation. This allows us to combine the advantages of generative and revision-based approaches: paraphrasing captures complex edit operations, and the use of explicit edit operations in an iterative manner provides controllability and interpretability. We demonstrate these advantages of GRS compared to existing methods on the Newsela and ASSET datasets.",
}

About

An unsupervised approach to sentence simplification that combines text generation and text revision.


Languages

Language:Roff 76.2%Language:Python 23.8%