AllenNLP for semantic parsing

This repository contains the data, configuration files, and scripts needed to reproduce the following results on the ATIS, GEO, and JOBS semantic parsing datasets, using the AllenNLP framework:

Model	ATIS	GEO	JOBS
S2S + attention	79.9	68.9	71.4
S2S + attention + ELMo	83.3	75.7	77.9
S2S + attention + OpenAI GPT	83.3	76.8	83.6
S2S + attention + BERT (Base)	83.5	75.7	82.9
S2S + attention + BERT (Large)	83.0	73.2	80.7

Make sure you have AllenNLP installed first!

Training models

To train a model:

make train
# ... follow the prompts to specify the path to your model config
# (e.g. experiments/atis/seq2seq.json) and serialization directory.

Prediction and Evaluation

After training, to generate predictions:

allennlp predict --output-file [FILENAME] --predictor simple_seq2seq \
[SERIALIZED_MODEL] [INPUT_JSONL]

For example, to generate predictions on ATIS for a model that has been serialized to /tmp/models/atis/seq2seq/run_001:

allennlp predict --output-file predictions/atis/seq2seq.jsonl \
--predictor simple_seq2seq \
--include-package nlpete.data.dataset_readers \
--include-package nlpete.models \
--include-package nlpete.training.metrics \
/tmp/models/atis/seq2seq/run_001/model.tar.gz data/atis/atis_test.jsonl

I have already included the predictions in the "predictions" folder for those who simply want to verify the results. Once the predictions have been generated, the accuracy of the model can be calculated against the gold outputs on the test set by following the code in results.ipynb.

About

Experiments with AllenNLP on semantic parsing datasets

Languages

Language:Python 86.7%Language:Shell 6.6%Language:Jupyter Notebook 5.3%Language:Makefile 1.2%Language:Perl 0.2%