Alibaba-MIIL / ImageNet21K

Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

What is the teacher model when using semantic softmax with KD?

Phuoc-Hoan-Le opened this issue · comments

What is the teacher model when using semantic softmax with KD? The figure in the paper is not clear on what the teacher is. Or in there is no code example on how to use the KD