aryankargwal / pytorch-mask-rcnn

Pytorch Implementation of Instance Segmentation using Mask RCNN

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

pytorch-mask-rcnn


Computer Vision is a field of Machine Learning that enables computers and automative systems to derive meaningful information from digital images, videos and other visual inputs. With other spheres of AI work to make the computers to think, Computer Vision is serves as one of the biggest components that gives them the information to do so.

Different tasks in Computer Vision


  • Image Classification: the task of assigning an input image one label from a fixed set of categories.
  • Object Localization: the task of assign a single label to an image corresponding to the “main” object.
  • Object Detection: the task of making bounding boxes around real life objects in an image or video.
  • Image Segmentation: the task of partitioning a digital image into multiple segments (sets of pixels, also known as image objects).

Image Segmentation


The purpose of segmentation is to make an image more intelligible and easier to examine by simplifying and/or changing its representation. In humans to objects and boundaries (lines, curves, etc.) in images are often located via an unconscious image segmentation.
In Layman's terms, image segmentation is the process of assigning a label to every pixel in an image such that pixels with the same label share certain characteristics.

Types of Image Segmentation

Instance Segmentation

Instance Segmentation is identifying each object instance for every known object within an image. Instance segmentation assigns a label to each pixel of the image.

Resources

PPT Link
Colab Notebook
Papers with Code
ICCV Presentation

About

Pytorch Implementation of Instance Segmentation using Mask RCNN


Languages

Language:Jupyter Notebook 100.0%