mounicam / hedge_trimmer

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

The project implements a set of deletion rules to simplify a sentence. The rules are taken from the paper - "Hedge Trimmer: A Parse-and-Trim Approach to Headline Generation"

To run the code, use the following format: python main.py --complex <input file with complex sentences> --simple <out file with simplified sentences>

Please make sure you run the Stanford CoreNLP server on the port 9000, before you run the script. This is needed extract the constituency parse of the sentence.

Currently, there is not final word threshold for deletion. In other words, the deletion is aggresive. To change this setting, feel free to increase the THRESHOLD in main.py file.

About


Languages

Language:Python 100.0%