why use label in loss.py

Question

why use label in loss.py

gancx opened this issue a year ago · comments

Thanks for your great work. I understand you use label to compute precision in loss.py. However, you initiated label tensor in 81th row. I wonder how and why you set label like this. Thank you.

Tao Ruijie · Answer 1 · Thu Feb 09 2023 15:29:58 GMT+0800 (China Standard Time)

Sorry did not get your point. Where is 81th row ?

C.X. Gan · Answer 2 · Thu Feb 09 2023 15:32:15 GMT+0800 (China Standard Time)

Sorry, I mistake the row number. I mean this: label = torch.from_numpy(numpy.asarray(list(range(batch_size - 1,batch_size*2 - 1)) + list(range(0,batch_size)))).cuda() in loss.py.

Tao Ruijie · Answer 3 · Thu Feb 09 2023 15:35:28 GMT+0800 (China Standard Time)

It is the labels for trianing in stage 1, let's say we have 200 data , 400 segments in one minibatch.

The positive pairs are segments from the same data. Negative pairs are segments from the different data. So we can build a label matrix:

1, 0, 0.... , 0
0, 1, 0..... , 0
0, 0, 1.... , ..
....
0, 0, 0......, 1

something like that. The diagonal is 1 and the rest part is 0.

C.X. Gan · Answer 4 · Thu Feb 09 2023 15:45:21 GMT+0800 (China Standard Time)

However, the output label may be [2, 3, 4, 0, 1, 2] if the batch size is 3. what's the relationship between output label and label matrix you mentioned?

Tao Ruijie · Answer 5 · Thu Feb 09 2023 16:00:53 GMT+0800 (China Standard Time)

As what I remember, batchsize=3 -> 6 segments

for each segments, itself can not be selected for choice, so there are 5 choices (0,1,2,3,4)

so the matrix I mentioned (similar to one-hot) can be transformed into the classification label, [2,3,4,0,1,2] means the class. This order is due to the processing in computing the loss.

This part is only used to compute the trianing accuracy for understanding the training process, so it will effect the results.

C.X. Gan · Answer 6 · Thu Feb 09 2023 16:08:51 GMT+0800 (China Standard Time)

Ok. it makes sense to me. You do a transformation between original label and one-hot vector. I may understand. Thanks for your explanation.