MarkWuNLP / actor-critic-public

The source code for "An Actor Critic Algorithm for Structured Prediction"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Actor-Critic for Sequence Prediction

The reference implementation for the paper

An Actor-Critic Algorithm for Sequence Prediction (openreview, submitted to ICLR 2017)
by Dzmitry Bahdanau, Philemon Brakel, Kelvin Xu, Anirudh Goyal, Ryan Lowe, Joelle Pineau, Aaron Courville, Yoshua Bengio

Note, that in fact it is a heavily modified speech recognizer, so please do not be surprised by the presense of speech related terms in the code. The code is provided only for replication purposes, further development is not planned. If you have questions, please contact Dzmitry Bahdanau or just create an issue here.

How to use

  • install all the dependencies (see the list below)
  • set your environment variables by calling source env.sh
  • for training use $LVSR/bin/run.py train <save-to> <config>
  • for testing use $LVSR/bin/run.py search <model-path> <config>

Please proceed to exp/ted for the instructions how to replicate our machine translation results on TED data, or to exp/billion_words in order to run our spelling correction experiments.

Dependencies

  • Python packages: pykwalify, toposort, pyyaml, numpy, pandas, picklable-itertools, theano;
  • blocks
  • blocks-extras
  • fuel

The code in this repository is known to work with master branches of the repositories listed above as of 21.11.2016

License

MIT

About

The source code for "An Actor Critic Algorithm for Structured Prediction"

License:MIT License


Languages

Language:Jupyter Notebook 94.4%Language:Python 5.0%Language:Shell 0.6%