Why is zero-loss-cycle always part of loss computation?

Question

Why is zero-loss-cycle always part of loss computation?

vadimkantorov opened this issue 3 years ago · comments

https://github.com/ajabri/videowalk/blob/master/code/model.py#L171 :
xents = [torch.tensor([0.]).to(self.args.device)]
loss = sum(xents)/max(1, len(xents)-1)

Is it just to have something evaluated to zero if there are no walks? In what case does this happen in practice?

Thank you @ajabri !

A. Jabri · Answer 1 · Tue Mar 23 2021 02:54:55 GMT+0800 (China Standard Time)

Hi @vadimkantorov,

Yes, I believe that's the reason for the line, but it can be removed! With the existing code, this only happens if your sequence is too short and no cycles are compute (I believe with sequence leng <= 2).

Mattia Segù · Answer 2 · Mon Oct 25 2021 19:54:38 GMT+0800 (China Standard Time)

Hi @ajabri, thanks a lot for sharing your code! Is there any particular reason why you enforce that subsequence length must be >2 in order to take that cycle into account? I'm talking about line 147 of model.py, where range starts from 1: for i in list(range(1, len(A12s))): .... Why not starting from 0?