Location softmax

Question

Location softmax

pmorerio opened this issue 8 years ago · comments

Hi, thanks for sharing the code!
It looks like the location softmax implemented in the conditional lstm is not the one you describe in the paper 'Action Recognition with Visual Attention' (eq. 4), but rather the one described in eqns. 4,5,6,9 in 'Show, Attend and Tell: Neural Image Caption Generation with Visual Attention'. Could you please comment on that? Really appreciate, thanks!

GerardoHH · Answer 1 · Wed Oct 12 2016 06:01:09 GMT+0800 (China Standard Time)

Hi @kracwarlock ,

I have a question, in your master thesis, in the equation 2.4, you mention that "Wi are the weights mapping to the ith element of the location softmax", I dont fully undertand this, can you explain it a little more ? thank you !

Shikhar Sharma · Answer 2 · Sun Dec 18 2016 14:44:24 GMT+0800 (China Standard Time)

@pmorerio the location softmax in my case is presented in (4) and in their case it is (4)+(5)
(6) and (9) hold true for both papers
the results in our paper are based on the equation in our paper's (4)
however the show attend tell paper's (4)+(5) seem to produce better results since they look at the current frame's features as well

Shikhar Sharma · Answer 3 · Sun Dec 18 2016 14:50:20 GMT+0800 (China Standard Time)

@GerardoHH i just meant that W_i are the weights for l_{t,i} (ith element of the location softmax).

Pietro Morerio · Answer 4 · Mon Dec 19 2016 16:45:32 GMT+0800 (China Standard Time)

@kracwarlock Thanks a lot for your answer. I must say that this does not look the case since in the code the location softmax is calculated base on both current features AND previous hidden state. The two contributions are then added together here and used to activate a softmax layer.