AbNeil / Recommendation-for-understanding-of-semantic-segmentation-using-CNN

Easy understanding of the semantic segmentation using CNN with some recommended links.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Recommendation for the understanding of semantic segmentation using CNN

In general, CNN for the semantic segmentation consists of two essential parts for the pixel-wise classification to assign the pixel labels. The first part is encoder composed of convolutional, which is responsible for the automatic features extraction and sub-sampling layers. The purpose of the sub-sampling layers is to achieve spatial in-variance by reducing the resolution of the feature maps, which also improves the robustness of the classifier due to the elimination of the redundant features. Sub-sampling also increases the field of view of the feature maps to extract more abstract class salient features and minimizes computation time. Sometimes, dropout layer or batch normalization or even both are added after each convolution to avoid the overfitting and to make the robust pixel-wise classifier. On the other hand, the second part is decoder to semantically project the discriminating features of lower resolution learned by the encoder onto the pixel space of higher resolution to get a dense pixel-wise classification. In semantic segmentation, the encoder part is quite similar to all the CNN models but they mainly differ in decoder mechanism. Semantic segmentation not only requires discrimination at pixel level but also a decoder mechanism to project the discriminating features learned at different stages of the encoder onto the pixel space. However, the significantly reduced features map due to sub-sampling suffers spatial resolution loss, which introduces coarseness, less edge information, checkerboard noise, and over-segmentation in the semantically segmented image. When the kernel size of the deconvolution is not divisible by the up-scaling factor, the number of low-resolution features that contribute to a single high-resolution feature is not constant across the high-resolution feature maps, which is called deconvolution overlap and is one of the causes of checkerboard artifact in the segmented mask.

Useful links for understanding semantic segmentation

Understanding of U-Net

Understanding of Fully Convolutional Networks (FCN)

Understanding Region-based Fully Convolutional Networks (R-FCN) for object detection

Github Source code link

Semantic segmentation challenges in different fields

Written by-

Md. Kamrul Hasan

Erasmus Scholar on Medical Imaging and Application (MAIA) [http://maiamaster.udg.edu/]

For more details write me at kamruleeekuet@gmail.com

About

Easy understanding of the semantic segmentation using CNN with some recommended links.