computer-vision convolutional-neural-networks deep-learning deep-neural-networks image-segmentation machine-learning python rcnn recurrent-neural-networks semantic-segmentation tensorflow

Segmentation as a Scene Labeling task with Recurrent Convolutional Neural Networks

Experiments with Scene Labeling using Recurrent Convolutional Neural Networks on two different segmentation tasks:

Formainifera image segmentation to different chambers and apertures
Segmentation of Unity simulated enviroments to safe and unsafe zones

Overview

This project implements a "Recurrent Convolutional Neural Network" (rCNN) for scene labeling, as seen in Pinheiro et al, 2014.

Summary of files

README.md           -- README file
category_maps/      -- Text files containing category info for "Stanford"  & "Data from Games" datasets
model.py            -- Code for rCNN model
preprocessing.py    -- Code for processing input data
requirements.txt    -- Lists python package requirements
train.py            -- Script for training and evaluating model

Installation

Clone or download this repository to your computer:

git clone https://github.com/jerisalan/RCNN-Segmentation.git
Install the necessary Python requirements through pip:

pip install -r requirements.txt

This project only requires Tensorflow and PIL for image manipulation. TensorFlow v1.3.0 has been used for training and testing but it should work with higher versions too without any issue.
Download a dataset with which to use the model. We used the Stanford Background Dataset which has 700+ 320x240 images and 8 classes. The code also works with the Data from Games dataset, which has 25,000 1914 × 1052 with 38 categories.

To use another dataset, make sure it is organized similarly to one of the above two, and specify while training and testing which dataset it is "mimicking". Specifically, both datasets had the data in one folder, with subfolders "labels" and "images" for labels and images, respectively. The stanford dataset had labels in the format of space-separated digits in a text file, while the "Data from Games" dataset had labels in the form of paletted images, where each color corresponds to a different label.
Generate a text file that maps colors to category numbers nad labels. Each line of the file has five space-separated values:

R G B category_num category_id

R,G,B values should be in the range [0,1]. Category files for the Stanford Background and Data From Games datasets are provided in the folder category_maps.

Running

Training

For training, use the train.py script with the --training flag. The following command trains the model on the Stanford dataset:

python3 train.py --training --dataset stanford-bground --category_map category_maps/stanford_bground_categories.txt --data_dir train_data/ --model_save_path train_model_rcv/

Running train.py -h will show additional parameters for the script, including different hyperparameters.

Testing

For testing, use the train.py script without the --training flag. This script will get per-class accuracies for each image, as well as output predicted labels as image files. The following command loads a saved model and evaluates accuracy on the stanford data set:

python3 train.py --model_load_path train_model_rcv/ --category_map category_maps/stanford_bground_categories.txt --dataset stanford-bground --data_dir test_data/ --output_dir test_output_rcv/

This outputs per-clas accuracies for each layer of the recurrent rCNN, and also saves predicted labels for each layer.

Credits

The code for the original project may be found at: https://github.com/NP-coder/CLPS1520Project

About

Experiments with Scene Labeling using Recurrent Convolutional Neural Networks on Foraminifera and Unity simulated images for Segmentation task

computer-vision convolutional-neural-networks deep-learning deep-neural-networks image-segmentation machine-learning python rcnn recurrent-neural-networks semantic-segmentation tensorflow

Languages

Language:Python 100.0%