I wonder why do log_softmax twice on the prediction logits?
lixinsu opened this issue · comments
Is this a special procedure for SAN module?
It is only to make the api consistence, which is output of logits.
Stochastic Answer Networks (SAN) for Machine Reading Comprehension
lixinsu opened this issue · comments
Is this a special procedure for SAN module?
It is only to make the api consistence, which is output of logits.