JohnUiterwyk / GeneticVision

Genetic Vision is a machine learning application that evolves computer vision programs using RmitGP and OpenCV

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

GeneticVision

by John Uiterwyk

GeneticVision is a C++ machine learning application that evolves computer vision programs.

This application utilises the following libraries:

  • RmitGP - a genetic programming library developed at RMIT University .
  • OpenCV - an open source computer vision library
  • jsoncpp - a json library

Version 1.2 Release Notes

  • Added multi-threading and --numOfThreads option
  • Updated to C++11 (n.b. gcc 4.9+ and Cmake 3.1+ are required now)
  • Filename of outputted images now zero fills the generation id portion of the file name
  • Bug fixes and other minor improvements
  • Added more examples

Version 1.1 Release Notes

  • Added test flag flag which enables performance stats and color annotated result images
  • Runs now create a timestamped directory in the output folder
  • Bug fixes and other minor improvements

Table of Contents

Supported Platforms

  • Linux
  • OS X

The application has been tested on OS X 10.10, CentOS 6.3, and Amazon Linux AMI 2015.03

Prerequisites

  • OpenCV 2.4.x
  • CMake 3.1+
  • GCC 4.9+

Downloading

You can download a zip of the the latest version of GeneticVision from github Alternatively you can clone the repo from github

Using GeneticVision

The simplest way to run GeneticVision (GV for short) is as follows:

$ ./GeneticVision --images path/to/images

This will evolve a population using the images provided and with default settings.

Additional Examples

Load a population and continue evolving

$ ./GeneticVision --population path/to/population.gen --images path/to/images

Load a population and test it with a set of images, saving color annotated result images

$ ./GeneticVision --test --population path/to/population.gen --images path/to/images --saveResultImages

Load a config file

$ ./GeneticVision --config path/to/config.json

Configure a run with command line options

$ ./GeneticVision --images path/to/images --populationSize 300 --generations 1000 --targetFitness 0.1

Preparing Training Images

The Genetic Vision application works by taking a source image and trying to evolve an application that predicts a target image. Ideally you provide the application with a large set of source/target image pairs to train with. Genetic Vision expects a set of images to be contained withing a directory and using a specific naming convention. The basic rule is the target image should have the same name as the source file with -mask added to the filename. So for example, if you have a source image example.png, the target image should be named example-mask.png. The application accepts the following file types:

  • .png
  • .jpg
  • .bmp
  • .tif

Output

When you run GeneticVision, it will create a folder in the current directory names output/ . Each run creates a timestamped directory in the output directory. Each timestamped run directory incldues run.log, gen files, and result images. The timestamped directory uses the pattern gv-run-yyyy-mm-dd-hh-mm-ss. For example output/gv-run-2015-07-02-14-45-41.:

  • output/gv-run-2015-07-02-14-45-41/populations where it will write population files
  • output/gv-run-2015-07-02-14-45-41/images directory where result image will be written.
  • output/gv-run-2015-07-02-14-45-41/run.log a log output of the evolution runs

You can change the output directory via the outputPath option.

Options:

The only required option is the images option. Without images, nothing can be done. Options can be set via config file or via command line option. Command line options will override config file settings. Loading a population file will override both command line and config file settings.

Evolve

Evolve will evolve a population of programs.
Command line usage: --evolve
Config file usage: "evolve":"true" Option argument type: bool Default: true

Test

Test output averaged accuracy/error information of the best program.

  • If evolve is false, this will just test the starting population. This is intended to be used to test a population file.
  • If evolve is true, this test will be performed every X generations where X is the logFrequency. Note: Evolve defaults to false when the test flag is included, you must explicitly include the evolve flag when using the test flag.

The test flag is used to assess the performance of producing a given black and white target mask. Black is considered a positive, and white is considered a negative.

This will output the following stats to stdout:

  • Accuracy
  • Error rate
  • True positive rate
  • False positive rate
  • True negative rate
  • False negative rate

This will write images to disk with true/false positive/negative coloration. Performance assesment images use the following color coding:

  • Black: true positive
  • White: true negative
  • Green: false positive
  • Red: false negative

This will output a window of true/false positive/negative colored result image using OpenCV HighGui (if enabled).

Command line usage: --test
Config file usage: "test":"true"
Option argument type: bool Default: false

Example: Load a population, test the best program, and save the resulting predicted images with performance coloration:

$ ./GeneticVision --test --population path/to/population.gen --images path/to/images --saveResultImages

Load Images

Load a set of images for the purpose of training, testing, or running. This option requires that images sets be prepared using a naming convention. Training is performed using pairs of images: a source image and a target image. The target image is what we want the evolved program to output given the source images. Target images should be have the same filename as the source images only with the string '-mask' inserted before the file extension. For example:

easy-square0001.png
easy-square0001-mask.png
easy-square0002.png
easy-square0003-mask.png
...

Be sure to include the trailing slash.

Command line usage: --images path/to/images/ (path relative to working directory) Config file usage: "images":"path/to/images/" (path relative to config file) Option argument type: string path to images directory

Maximum Generations

Maximum number of generations to evolve a population. The evolution is terminated when a program reaches the target fitness or when the maximum number of generations has been reached.

Command line usage: --generations 3000
Config file usage: "maxGenerations": 3000 Option argument type: integer Default: 100

Population Size

The number of programs in the population.

Command line usage: --populationSize 100
Config file usage: "populationSize": 100
Option argument type: integer Default: 100

Mutation Rate

The rate of mutation when producing a new generation from a population.

Command line usage: --mutation 0.70
Config file usage: "mutation": 0.70
Option argument type: float between 0 and 1
Default: 0.70

Crossover Rate

The rate of crossover when producing a new generation from a population.

Command line usage: --crossover 0.28
Config file usage: "crossover": 0.28
Option argument type: float between 0 and 1
Default: 0.28

Elitism Rate

The rate of elitism when producing a new generation from a population.

Command line usage: --elitism 0.02
Config file usage: "elitism": 0.02
Option argument type: float between 0 and 1
Default: 0.02

Minimum Depth

The minimum depth of a program tree.

Command line usage: --minDepth 2
Config file usage: "minDepth": 2
Option argument type: integer
Default: 0.02

Maximum Depth

The maximum depth of a program tree. Increasing this can result in an exponential increase in run time for a generation.

Command line usage: --maxDepth 5
Config file usage: "maxDepth": 5
Option argument type: integer
Default: 5

Target Fitness

The target fitness for the evolution run. If the best program of a population is less than this number, then the evolution run is terminated.

Command line usage: --targetFitness 0.02
Config file usage: "targetFitness": 0.02
Option argument type: float between 0 and 1
Default: 0.02

Log Frequency

This sets how frequently logging information is outputed. This option affects the frequency of the following:

  • writing summary information for the current generation to stdout
  • writing result images to disk for the current generation
  • writing gen files to disk

Command line usage: --logFrequency 10
Config file usage: "logFrequency": 10
Option argument type: integer
Default: 20

Output Path

Each run creates a timestamped directory in the output directory. Each timestamped run directory incldues run.log, gen files, and result images. The timestamped directory uses the pattern gv-run-yyyy-mm-dd-hh-mm-ss. For example output/gv-run-2015-07-02-14-45-41.

Command line usage: --outputPath output/
Config file usage: "outputPath":"output/"
Option argument type: string path to output directory
Default: output/

Save Result Images

If this option is enabled, every X generations, an image will be written to disk for each image pair in the training set. The frequency of image writing is determined by the logFrequency option. The file pattern will be filenamekey-gen-000001.png. Image files will be written to the images/ directory in the output directory. If the --test flag is used images images will include the performance coloration.

Number of threads

GeneticVision supports multithreading.

Command line usage: --numOfThreads Config file usage: "numOfThreads": 16 Option argument type: integer Default: 2 x number reported by std::thread::hardware_concurrency.

Load Population File

Load a population from a *.gen file. The value for this option should be the path to the gen file. Loading a population file will overide most population specific settings.

Command line usage: --population path/to/file.gen (path relative to working directory) Config file usage: "population": "path/to/file.gen" (path relative to config file) Option argument type: string

Load Configuration File

Load settings from a JSON formatted configuration file. Please note that image and population paths within a config file are relative to the config file containing directory unless the cat Path option is set.

Command line usage: --config path/to/config
Option argument type: string

Example Config File:
    {  
        "maxGenerations":3000,  
        "populationSize": 150,  
        "mutation":0.28,  
        "crossover":0.70,  
        "elitism":0.02,  
        "minDepth": 2,  
        "maxDepth":5,  m
        "targetFitness":0.01,  
        "logFrequency":100,  
        "outputPath":"output/",  
        "saveResultImages": false,  
        "images":"images/"  
    }

Compiling

The basic process for compiling GeneticVision is as follows:

  • Install OpenCV; GeneticVision requires that OpenCV has been installed correctly.
  • Clone the GeneticVision project from the GitHub repo
  • Compile the project using CMake 2.8 or higher
  • Optional: You may need to set the path to a newer version of gcc if you are having issues compiling. You can do this by $ export CXX=path/to/g++
  • Prep a directory of images with proper naming conventions.
  • Execute an evolution run using the compiled binary and the prepared images

Step By Step Guide

  • Make the project directory $ mkdir ~/genetic-vision/
  • Change to the project directory $ cd ~/genetic-vision/
  • Clone the project from the github repo: $ git clone https://github.com/JohnUiterwyk/genetic-vision.git .
  • Make a directory for the build files $ mkdir ~/genetic-vision/build
  • Make a directory for the bin files $ mkdir ~/genetic-vision/bin
  • Change to the build directory $ cd ~/genetic-vision/build
  • Compile the project with CMake, passing the path to the parent folder $ cmake ~/genetic-vision . This is performing what is known as an 'out of source build'. It helps to keep the resulting file/folder structure clean.
  • Make the executable $ make
  • Move the executable to the bin directory $ mv GeneticVision ~/genetic-vision/bin
  • Delete the build directory $ rm -rf ~/genetic-vision/build
  • Change to the data working directory $ cd ~/my-test-data/
  • Create a folder for the images $ mkdir training-images/ and add source/target pairs of images using the convention described in the "Preparing Training Images" section.
  • Run GV with the images directory $ ~/genetic-vision/bin/GeneticVision --evolve --images ./training-images

GUI CMake Option

To compile the application with support for an output to a window, compile with the Cmake option "-D GUI=ON". You will need to have installed OpenCV with HighGui support enabled.

Developer Guide

The code has been thoroughly commented and a reference has been generated using Doxygen. You can view the doxygen output here: (TODO)

Project Road Map

You can view the project roadmap task board here

Release Log

  • 1.1
    • Added test flag which enables preformance stats and color annotated result images
    • Runs now create a timestamped directory in the output folder
    • Bug fixes and other minor improvements
  • 1.0
    • Initial release

About

Genetic Vision is a machine learning application that evolves computer vision programs using RmitGP and OpenCV


Languages

Language:C++ 98.1%Language:CMake 0.7%Language:Shell 0.6%Language:C 0.4%Language:Makefile 0.2%