Jasminexjf/norm-rnn

Theano code for the Penn Treebank language model experiments in the paper Batch Normalized Recurrent Neural Networks. The baseline LSTMs reproduce the results from the paper Recurrent Neural Network Regularization.

The name of the repo is, of course, based off of Karpathy's char-rnn.

Theano is required for running the experiments:

pip install Theano

Plotly is optional and is used to generate plots after training:

pip install plotly

To run the small reference model on CPU:

python experiments/ptb_small_ref.py

To run the small normalized model on GPU:

THEANO_FLAGS=device=gpu python experiments/ptb_small_norm.py

About

Batch Normalized Recurrent Neural Networks

Language:Python 56.6%Language:TeX 43.4%