About Aggregation Module And Affinity Loss

Question

About Aggregation Module And Affinity Loss

JialeTao opened this issue 4 years ago · comments

JialeTao commented 4 years ago

Hi, first congratulations for the good work. And I have some problems in reading your paper.

In aggregation module, depth-wise separable convolutions is used. To my knowleadge, depth-wise separable convolution does not change the channel of input features while it is changed from $C_0$ to $C_1$ in paper. How it is implemented?
Does the gradient of affinity loss backward to segmentation backbone? Or only backward to context prior layer?
Noted that an ideal affinity map is used as a supervision for context prior map, how about directly applying ideal affinity map for constructing the concatenated feature? And thus context prior layer can be needless?

Thanks.

Mayy1994 · Answer 1 · Tue Apr 14 2020 14:34:51 GMT+0800 (China Standard Time)

Hi, I just read this paper and the following is my understanding.

Maybe the authors changed the number of channels by using 1x1 Conv after depth-wise separable convolutions.
Normally, the gradient should backpropagate to the segmentation backbone.
For validation and testing, there is no ideal affinity map. Hence, context prior is necessary.

Changqian · Answer 2 · Fri Apr 17 2020 11:18:43 GMT+0800 (China Standard Time)

Thanks for your attention.

In Fig.4 (a) of the paper, the Aggregation Module has one 3x3 conv and two asymmetric fully separable convs. The first 3x3 conv can reduce the channel dimension.
The gradient should back-propagate to the segmentation backbone.
The ideal affinity map is constructed from the Ground Truth. However, in the inference phase, we can not obtain the Ground Truth to construct the ideal affinity map. Therefore, we design a Context Prior Layer to mimic and approach the ideal affinity map.

JialeTao · Answer 3 · Fri Apr 17 2020 11:29:41 GMT+0800 (China Standard Time)

Hi, I just read this paper and the following is my understanding.

Maybe the authors changed the number of channels by using 1x1 Conv after depth-wise separable convolutions.

Normally, the gradient should backpropagate to the segmentation backbone.

For validation and testing, there is no ideal affinity map. Hence, context prior is necessary.

Thanks very much. The author just answered the first question and I did not notice the 3 by 3 convolution before the fully separation convolution branch.

JialeTao · Answer 4 · Fri Apr 17 2020 11:31:04 GMT+0800 (China Standard Time)

Thanks for your attention.

In Fig.4 (a) of the paper, the Aggregation Module has one 3x3 conv and two asymmetric fully separable convs. The first 3x3 conv can reduce the channel dimension.

The gradient should back-propagate to the segmentation backbone.

The ideal affinity map is constructed from the Ground Truth. However, in the inference phase, we can not obtain the Ground Truth to construct the ideal affinity map. Therefore, we design a Context Prior Layer to mimic and approach the ideal affinity map.

Thanks very much! I didn't notice that before.