RNN LSTM layer

Question

RNN LSTM layer

bhack opened this issue 9 years ago · comments

@jeffdonahue @sguada Do you have code to share for your pubblication: http://arxiv.org/abs/1411.4389?

Jeff Donahue · Answer 1 · Tue Dec 30 2014 17:59:11 GMT+0800 (China Standard Time)

Thanks for your interest in our work! We definitely plan to release code, but there's quite a bit of work to do to get it into a reasonably sane state -- there will be PRs once it's ready.

Shuai Kyle Zheng · Answer 2 · Tue Dec 30 2014 19:25:10 GMT+0800 (China Standard Time)

@jeffdonahue Could you provide some comments if we would like to integrate caffe with the LSTM code from karpathy? Cheers.

bhack · Answer 3 · Tue Dec 30 2014 20:34:46 GMT+0800 (China Standard Time)

@bittnt Do you mean @karpathy at https://github.com/karpathy/neuraltalk?

Shuai Kyle Zheng · Answer 4 · Tue Dec 30 2014 23:18:45 GMT+0800 (China Standard Time)

@bhack Yes. I think someone has already hacked it together. :))

Balint Cristian · Answer 5 · Mon Jan 12 2015 04:47:09 GMT+0800 (China Standard Time)

Should see this: https://github.com/dophist/kaldi-lstm (holds CUDA code too, i think is the best one).

Nako Sung · Answer 6 · Mon Jan 26 2015 15:52:52 GMT+0800 (China Standard Time)

https://github.com/junhyukoh/caffe-lstm nice work from u-mich.

bhack · Answer 7 · Mon Jan 26 2015 17:34:32 GMT+0800 (China Standard Time)

@junhyukoh Do you have a plan to contribute back to caffe with a PR?

Baigui Sun · Answer 8 · Mon Jan 26 2015 20:46:50 GMT+0800 (China Standard Time)

@junhyukoh looking forward to your merge :D

Junhyuk Oh · Answer 9 · Tue Jan 27 2015 00:14:40 GMT+0800 (China Standard Time)

@bhack @sunbaigui Thank you for your interest!
But, I think my current implementation does not perfectly fit to Caffe.
Since I'm using a mini-batch as a training sequence (an unrolled RNN), my code supports only SGD not the mini-batch update (one update after processing several training sequences).
I plan to rewrite the code and do PR when it's ready, but I cannot guarantee the timeline.
Feel free to use my code and any comments are welcome!

Jeff Donahue · Answer 10 · Tue Feb 17 2015 17:37:52 GMT+0800 (China Standard Time)

Closing, see #1873 for (a cleaned up version of) the implementation we used for LRCN.