这里操作是不是有问题？

Question

这里操作是不是有问题？

pkuyilong opened this issue 4 years ago · comments

box1_xyxy[:, :2] = box1[:, :2] / 14. - 0.5 * box1[:, 2:4]
box1_xyxy[:, 2:4] = box1[:, :2] / 14. + 0.5 * box1[:, 2:4]
box2 = box_target[i].view(-1, 5)
box2_xyxy = Variable(torch.FloatTensor(box2.size()))
box2_xyxy[:, :2] = box2[:, :2] / 14. - 0.5 * box2[:, 2:4]
box2_xyxy[:, 2:4] = box2[:, :2] / 14. + 0.5 * box2[:, 2:4]

这里预测出来的xywh应该都是[0-1]，这里除以14没有意义吧

mayilong · Answer 1 · Fri May 22 2020 15:26:41 GMT+0800 (China Standard Time)

@xiongzihua 如果有空恳请交流一下。

bear · Answer 2 · Tue May 26 2020 23:37:42 GMT+0800 (China Standard Time)

@pkuyilong
box1[:, :2]是在0-1之间，预测的是相对该网格左上角的x1,y1，是相对于网格的0-1。
box1[:, 2:4]也在0-1，预测的是wh，是相对全图的0-1。
计算IOU时，除以14将x1,y1的坐标从网格视角转换到全图视角，不知道这样能不能理解，代码的意图就是这样的，不知道是否合理，你可以继续思考一下，希望对你有所帮助

Vinctor · Answer 3 · Sun May 31 2020 20:56:29 GMT+0800 (China Standard Time)

@pkuyilong do you understand the idea of the author?

Vinctor · Answer 4 · Sun May 31 2020 20:57:48 GMT+0800 (China Standard Time)

@xiongzihua 能举个例子说明这种计算的正确性吗？

mayilong · Answer 5 · Tue Jun 02 2020 18:00:19 GMT+0800 (China Standard Time)

@pkuyilong do you understand the idea of the author?

Training phrase:
the prediction generated from model is 14 x 14,
the ground truth is scaled by origin image size(maybe 480 x 480),
so when calculating the loss, we need to divide the prediction data by 14, which is equal to scale the prediction data by origin image size.

mayilong · Answer 6 · Tue Jun 02 2020 18:42:45 GMT+0800 (China Standard Time)

@pkuyilong
box1[:, :2]是在0-1之间，预测的是相对该网格左上角的x1,y1，是相对于网格的0-1。
box1[:, 2:4]也在0-1，预测的是wh，是相对全图的0-1。
计算IOU时，除以14将x1,y1的坐标从网格视角转换到全图视角，不知道这样能不能理解，代码的意图就是这样的，不知道是否合理，你可以继续思考一下，希望对你有所帮助

又看了一会代码，差不多可以理解了，谢谢🙏