shreyassaxena / convolutional-neural-fabrics

Architecture learning for CNN's

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

convolutional-neural-fabrics

Introduction

In our work, we propose a “fabric” that embeds an exponentially large number of CNN architectures. The fabrics sidesteps the tedious process of specifying, training and testing individual networks in order to find good architectures. The fabric circumvents 8 out of 10 hyperparameters of the CNN architecture and has only 2 hyperparameters.

Detailed description of the system is provided in our arXiv technical report: https://arxiv.org/abs/1606.02492 [To appear at NIPS16]. Collected resources are available at our Project Page.

Citation

If you're using this code in a publication, please cite our paper.

@InProceedings{saxena2016convolutional,
  title={Convolutional Neural Fabrics},
  author={Saxena, Shreyas and Verbeek, Jakob},
  BookTitle={NIPS},
  year={2016}
}

System Requirements

  1. This software is tested on Fedora release 21 (64bit).
  2. MATLAB (tested with 2013b on 64-bit Linux)
  3. Prerequisites for caffe (http://caffe.berkeleyvision.org/installation.html#prequequisites).

Getting started

  1. Code: Our caffe based implementation is a modified version of - https://github.com/HyeonwooNoh/caffe.git. You need to recompile caffe for other platforms.

Training CNF

** Demo on MNIST

  1. Data: You need to download and post-process MNIST for the demo code (See ./MNIST/demo_mnist.txt for further instructions)
  2. Follow the steps in demo_mnist.txt

Third-party re-implementations

  1. PyTorch, training code for CIFAR10: code

About

Architecture learning for CNN's


Languages

Language:Jupyter Notebook 45.5%Language:C++ 43.5%Language:Python 4.7%Language:Cuda 2.7%Language:MATLAB 1.2%Language:CMake 0.8%Language:Protocol Buffer 0.7%Language:Makefile 0.3%Language:Shell 0.3%Language:CSS 0.2%Language:HTML 0.1%