Question about the regression problem

Question

Question about the regression problem

Elenore1997 opened this issue 3 years ago · comments

Hi, I have a question about the method of regression. In section 4.2 in your paper, why use kl-divergence loss between p(yu | xin) and the scaled score (y−vl)/(vu−vl) (loss = loss_fct(logits.view(-1, 2), labels)), but not cross entropy loss? The logits and labels here are both probability distribution on the 2 polarities.
Thanks in advance!

Adam Fisch · Answer 1 · Wed Nov 24 2021 00:22:33 GMT+0800 (China Standard Time)

Thanks for the question. It should be possible to use either one.

Ma Junteng · Answer 2 · Wed Nov 24 2021 21:23:46 GMT+0800 (China Standard Time)

Thanks for the question. It should be possible to use either one.

Thanks for the quick reply!