GMvandeVen / class-incremental-learning

PyTorch implementation of a VAE-based generative classifier, as well as other class-incremental learning methods that do not store data (DGR, BI-R, EWC, SI, CWR, CWR+, AR1, the "labels trick", SLDA).

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Question about model size

yutongred opened this issue · comments

Hi Gido,

Does this method have a drawback of a too large model size for large label space like ImageNet?

Hi, thanks for you question and interest in the code. The main point of the paper accompanying this code is to put generative classification forward as a promising strategy for class-incremental learning. This strategy of generative classification does not necessarily have the drawback of leading to a too large model size for large label spaces, because the generative models for the different classes can share parts of their networks with each other. It could be argued that the specific implementation of a generative classifier that was used in this paper as a proof-of-concept, with a separate generative model per class, has this disadvantage. Although in the comparisons in the paper this was controlled for by limiting the total number of parameters. Under a limited parameter budget, having completely separate models per class is the naive solution; by sharing parts of the models it should be possible to do better.

Thanks for your reply : )