Iteration method

Question

Iteration method

297774951 opened this issue 7 months ago · comments

Hello, I would like to ask about the iteration method used when updating two teachers in your paper (update the first teacher in this iteration, and update the other teacher in the next iteration). I saw the explanation in your paper. Just to increase the diversity between the two teachers. What are the benefits of using an iterative method to update two teachers?

Yuyuan Liu · Answer 1 · Tue Dec 19 2023 07:04:20 GMT+0800 (China Standard Time)

Hi @297774951

Instead of the iteration-wise update, we calculate each of the teacher's parameters in an epoch manner.

The teacher's parameter is highly rely on the student, which is updated via exponential moving average. The SGD optimiser and also the strong augmentations (including CutMix, color jittering) encourage student learned different parameters in different epochs, and thus suggesting different parameters for dual teachers, leading a related higher divergence.

Cheers,
Yuyuan

sijifit · Answer 2 · Tue Dec 19 2023 08:32:18 GMT+0800 (China Standard Time)

Why should dual teachers have a related higher divergence？

sijifit · Answer 3 · Tue Dec 19 2023 08:34:08 GMT+0800 (China Standard Time)

Can you reply if you have time? Thank you so much

sijifit · Answer 4 · Tue Dec 19 2023 08:40:26 GMT+0800 (China Standard Time)

Is it to strengthen the perturbation of the network to improve the generalization of consistency learning?

Yuyuan Liu · Answer 5 · Tue Dec 19 2023 08:59:58 GMT+0800 (China Standard Time)

Hi @wangmingaaaaa

Yes, various strong perturbation causes different optimisation of the student network in epochs, while its update to the teacher is also different. Please note, the comment of "related higher divergence" is in comparison with iterative-wise update method.

I believe dual teachers will eventually fall in same local minima, just like normal MT does, while our goal for such architecture is for more reliable pseudo label throughout the training process.

Cheers,
Yuyuan

sijifit · Answer 6 · Tue Dec 19 2023 09:30:54 GMT+0800 (China Standard Time)

Thanks for your reply!

Yuyuan Liu · Answer 7 · Tue Dec 19 2023 14:46:49 GMT+0800 (China Standard Time)

@wangmingaaaaa My pleasure!