toru34/rush_emnlp_2015

A Neural Attention Model for Abstractive Sentence Summarization

Unofficial DyNet implementation of the paper A Neural Attention Model for Abstractive Sentence Summarization (EMNLP 2015)[1].

To get preprocedded gigaword corpus, run

sh download_data.sh

--gpu: GPU ID to use. For cpu, set -1 [default: 0]
--n_epochs: Number of epochs [default: 3]
--n_train: Number of training data (up to 3803957) [default: 3803957]
--n_valid: Number of validation data (up to 189651) [default: 189651]
--batch_size: Mini batch size [default: 32]
--vocab_size: Vocabulary size [default: 60000]
--emb_dim: Embedding size [default: 256]
--hid_dim: Hidden state size [default: 256]
--encoder_type: Encoder type. [default: attention]
- bow: Bag-of-words encoder.
- attention: Attention-based encoder.
--c: Window size in neural language model [default: 5]
--q: Window size in attention-based encoder [default: 2]
--alloc_mem: Amount of memory to allocate [mb] [default: 8192]

python train.py --n_epochs 10

python test.py --beam_size 10

You can use pythonrouge[2] to compute the ROUGE scores. An example is in evaluate.ipynb.

Compute ROUGE scorew with 101 valid data.

	ROUGE-1 (F1)	ROUGE-2 (F1)	ROUGE-L (F1)
My implementation	31.56	14.56	30.02

Work in progress.

To get the pretrained model, run

sh download_pretrained_model.sh

[1] A. M. Rush et al. 2015. A Neural Attention Model for Sentence Summarization. In Proceedings of EMNLP 2015 [pdf]
[2] pythonrouge: https://github.com/tagucci/pythonrouge