This repository contains all the code & documents for the presentation held within Iași AI community.
- Deep Learning Book by Ian Goodfellow, Yoshua Bengio and Aaron Courville
- Lecture on Recurrent Neuural Networks held by Andrej Karpathy
- The Unreasonable Effectiveness of Recurrent Neural Networks article written by Andrej Karpathy
- Minimal character-level language model with a Vanilla Recurrent Neural Network gist written by Andrej Karpathy
The experiment part of the project was to see whether the vanilla RNN can be trained to generate lyrics for manele. These lyrics are low quality and very simplistic.
The vanilla RNN from the presentation was trained with 9 hyperparameter configurations as described in table below. Each training cycle ended at 1.000.000 iterations over a sequence of 25 characters.
Run Id | Hidden size | Learning rate | Loss |
---|---|---|---|
1 | 100 | 0.1 | N/A |
2 | 100 | 0.01 | 44.355101 |
3 | 100 | 0.001 | 59.384402 |
4 | 250 | 0.1 | 48.762288 |
5 | 250 | 0.01 | 37.229775 |
6 | 250 | 0.001 | 55.494079 |
7 | 500 | 0.1 | 52.587987 |
8 | 500 | 0.01 | 27.412387 |
9 | 500 | 0.001 | 53.266636 |
As can be seen from results, the model got stuck in a local minimum, the best results being achieved by run 8
with hidden_size=500
and learning_rate=0.01
.
If you're still curious you can browse the files with generated samples for each run:
- Run 1:
ni1000000-hs100-is25-lr0.1-2018-11-13-1546.txt
- Run 2:
ni1000000-hs100-is25-lr0.01-2018-11-13-1552.txt
- Run 3:
ni1000000-hs100-is25-lr0.001-2018-11-13-1555.txt
- Run 4:
ni1000000-hs250-is25-lr0.1-2018-11-14-1217.txt
- Run 5:
ni1000000-hs250-is25-lr0.01-2018-11-14-1216.txt
- Run 6:
ni1000000-hs250-is25-lr0.001-2018-11-14-1206.txt
- Run 7:
ni1000000-hs500-is25-lr0.1-2018-11-15-1326.txt
- Run 8:
ni1000000-hs500-is25-lr0.01-2018-11-15-1253.txt
- Run 9:
ni1000000-hs500-is25-lr0.001-2018-11-15-1324.txt