About the loss function and the textual feature

Question

About the loss function and the textual feature

JCZ404 opened this issue 3 years ago · comments

Hi, Thanks for your great work! But I have some problems with the loss function in your code. First, in the original paper, the author said he used the logistic regression loss function, but in your code, it seems you only calculate the positive and negative pair loss between the sentence and the image, Second, I wonder which task your code focus on, because in the original paper, it focus on the phrase grounding, however, in your code, it seems you didn't deal with the phrase in the caption, instead you treated the caption as a whole, could you give a little bit explanation about this?

ZhanYang · Answer 1 · Tue Sep 27 2022 10:07:59 GMT+0800 (China Standard Time)

Hi. Thank you for the excellent work. Could I ask you a question about your loss function? Although the code works fine, the loss value is always nan. Is there something wrong?