DenseNeck design
hao-pt opened this issue · comments
Have you tried different output channels for single projection and dense projection? Particularly, you used the same hidden channels and output channels for single mlp and dense mlp in the DenseCLNeck
impl. As I know, the projection of instance representation requires a greater number of channels than the projection of dense representation. Treating both of them equally might lose lots of useful information from instance representation. How do you think about this problem? Most instance discrimination methods also design the projector as fc-bn-relu-fc so I wonder why you drop bn
in DenseCLNeck
? Is it just for simplicity?
self.mlp = nn.Sequential(
nn.Linear(in_channels, hid_channels), nn.ReLU(inplace=True),
nn.Linear(hid_channels, out_channels))
...
self.mlp2 = nn.Sequential(
nn.Conv2d(in_channels, hid_channels, 1), nn.ReLU(inplace=True),
nn.Conv2d(hid_channels, out_channels, 1))