d-li14 / dgconv.pytorch

PyTorch implementation of Dynamic Grouping Convolution and Groupable ConvNet with pre-trained G-ResNeXt models

Home Page:https://arxiv.org/abs/1908.05867

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

how to train the DGConv model?

longxianlei opened this issue · comments

The regularizer term U is the number of the used parameters in the convolution filter.
U_regularizer = 2**(self.K + torch.sum(self.gate))
And every DGConv block has recorded these regularizer terms. And use these regularizers as complexity constraints. I was confused, how to add these terms in the final loss function. I have read some discussions about this paper. It said that the author used NAS method to search the architecture. How do you think about this?

refer to #1