Placement of Activation after SEBlock

Question

Placement of Activation after SEBlock

bernardomig opened this issue 4 years ago · comments

Hi. In your code, you place the activation (hard-swish), after the SE Block

self.conv = nn.Sequential(
                # pw
                nn.Conv2d(inp, hidden_dim, 1, 1, 0, bias=False),
                nn.BatchNorm2d(hidden_dim),
                h_swish() if use_hs else nn.ReLU(inplace=True),
                # dw
                nn.Conv2d(hidden_dim, hidden_dim, kernel_size, stride, (kernel_size - 1) // 2, groups=hidden_dim, bias=False),
                nn.BatchNorm2d(hidden_dim),
                # Squeeze-and-Excite
                SELayer(hidden_dim) if use_se else nn.Identity(),
                h_swish() if use_hs else nn.ReLU(inplace=True), ## <-- HERE!!!
                # pw-linear
                nn.Conv2d(hidden_dim, oup, 1, 1, 0, bias=False),
                nn.BatchNorm2d(oup),
            )

Is it correct, or should it be placed before the SE Block?

Duo Li · Answer 1 · Tue Jun 30 2020 08:35:06 GMT+0800 (China Standard Time)

I think it's correct, please refer to figure 4 of the paper. But in my practice, it seems that placing h-swish before SE brings marginal benefit.

Placement of Activation *after* SEBlock

Placement of Activation after SEBlock