recurrent-neural-networks synthetic-gradients tensorflow rnn dni decoupled-neural-interfaces metalearning lstm

sgrnn

Synthetic Gradient for Recurrent Neural Networks

This repo is a tensorflow implementation of the synthetic gradient, or DNI, for recurrent neural network (RNN). The architecture contains a multilayer LSTM RNN that is used for language modeling to do word-level prediction. For a detailed description of how synthetic gradient is applied to train this architecture, check out the blog post here.

The data required to run the model is in the data/ dir of the PTB dataset from Tomas Mikolov's webpage:

$ wget http://www.fit.vutbr.cz/~imikolov/rnnlm/simple-examples.tgz
$ tar xvf simple-examples.tgz

To run the RNN model,

$ python sgrnn/main.py --model=small --data_path=simple-examples/data/ \
    --num_gpus=0 --rnn_mode=BASIC --save_path=/tmp/sgrnn

Reference:

About

Tensorflow implementation of Synthetic Gradient for RNN (LSTM)

https://hannw.github.io/posts/synthetic-gradient-rnn

recurrent-neural-networks synthetic-gradients tensorflow rnn dni decoupled-neural-interfaces metalearning lstm

Languages

Language:Python 100.0%