KirillAI / stanford-drone-semantic-segmentation

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

stanford-drone-semantic-segmentation

Solving semantic segmentation task for Stanford Drone Dataset

Solution Brief:

  1. To train the semantic segmentation mask prediction, bounding boxes are used as are.

  2. Loss is cross-entropy.

  3. Metric is mean IoU.

  4. Transfer learning for encoder in U-Net, freeze encoder and train only decoder.

  5. Augmentation (random flip left-right and random shift).

Examples:

For some frame, the true mask looks like the figure below.

True mask

The figure below shows the predicted mask for this frame using MobileNet U-Net model before training.

Predicted mask before training

And after training on the small data, the predicted mask looks like

Predicted mask after training

TODO:

About

License:Other


Languages

Language:Python 70.1%Language:Makefile 29.9%