rwth-i6 / returnn-benchmarks

Benchmarks / comparisons to other RNN software

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

We are doing benchmarks.

Speed benchmarks

In these experiments we used the following hardware:

  1. Intel(R) Core(TM) i5-4690K CPU @ 3.50GHz

  2. 16 Gb LDDR-3

  3. Nvidia GTX-970

OS : Ubuntu 16.10 LTS, video driver : 367.35, CUDA 7.5, cuDNN-5 R5.

Network topology : 3 layer [B]LSTM with 512 hidden units per cell followed by softmax output layer. All experiments available to reproduce using run scripts or corresponding config files.

***Please notice that number of params in cudnn lstm is bigger, becuase they also provide the bias for the Recurrent matrix.

Framework Cell Type Avg. time per epoch
RETURNN LSTM 268 sec
RETURNN BLSTM 599 sec
Torch7 + cudnn LSTM 156 sec
Torch7 + cudnn BLSTM 389 sec
Keras [Theano] LSTM 1068 sec
Keras [Theano] BLSTM 2167 sec
TensorFlow LSTM wip
TensorFlow BLSTM wip

About

Benchmarks / comparisons to other RNN software


Languages

Language:Python 84.1%Language:Shell 9.9%Language:Lua 6.0%