NNNCVXF - Neural Network training via Non-ConVeX Feasibility

Neural Network training via Non-ConVeX Feasibility

This repository contains scripts to reproduce the examples from:

@article{Peters2021OutputConstrainedNetworks,
  title={Point-to-set distance functions for output-constrained neural networks},
  author={Peters, Bas},
  year={2021}
}

This software is not intended as a general neural network toolbox. Some generalizations of the current software are planned in the near future.

Installation for Julia 1.5:

add https://github.com/PetersBas/NNNCVXF.git

NNNCVXF also depends on the packages:

Examples:

Time-Lapse HyperSpectral land-use segmentation Segment two 3D data volumes of hyperspectral data to identify land use. Assumes just 20 point annotations for class one, no annotation for class 2, and prior knowledge on the percentage of surface area that experienced land-use change. (set up for GPU)
CamVid street scenes Segment 2D RGB images into different object types. In this modified CamVid experiment, we use just 47 images for training and 15 for validation, with 8 point annotations per class per image. We also assume approximate information on the anisotropic total-variation of the training images. (set up for GPU)
Single image segmentation with corruption Zebra Bike These experiments train a neural network to segment a single image based on a bounding box and some prior knowledge. No pre-training was used. The network is trained from scratch for each image. Images contain coherent corruption in the form of missing rows. Prior knowledge consists of a 'simple' image description using a Minkowski set that is the sum of monotonically increasing and decreasing image components, as well as rough bounds on the size of the object. (set up for CPU, although GPU would be faster). See also Bike (Grabcut,Python) and Zebra (Grabcut,Python)

Required Data:

Data is provided for the single image segmentation with corruption examples Zebra Bike. Data files are larger for Time-Lapse HyperSpectral land-use segmentation and CamVid street scenes. Download instructions are included in those two scripts.

Basic Code Functionality:

All examples follow the same workflow

Set up training (and possibly validation) data and labels (if any are available).
Set up projection operators that project onto the intersection of constraint sets, implemented by SetIntersectionProjection
Set up the neural network. This code fixes the network to be a fully reversible (invertible) hyperbolic network, implemented by InvertibleNetworks. You can still set the length, kernel sizes, width, number of input and output channels.
Train the network. This uses gradient descent (based) methods. The loss and gradient are computed via the squared distance of the neural network output to the intersection of constraint sets. This work shows that the gradient computation is possible via standard adjoint-state/backpropagation: 1) forward propagation of the input data through the network. 2) compute loss and final Lagrangian multiplier. 3) interleave backward propagation to obtain the other Lagrangian multipliers in reverse order, with the gradient computation for network parameters. 4) update network parameters at the end
Predict and plot results. After training, we obtain a prediction (that does not use any constraints) by forward propagating the input data though the network. The output is then the predicted probability per class for each pixel. Plotting shows things like loss function per iteration, network output per class, thresholded network ouput showing the most likely class for each pixel, and data + predition plotted on top of each other.

PetersBas / NNNCVXF

NNNCVXF - Neural Network training via Non-ConVeX Feasibility

Installation for Julia 1.5:

Examples:

Required Data:

Basic Code Functionality:

About

Languages