huangleiBuaa / CenteredWN

This project is the Torch implementation of our ICCV 2017 paper: Centered Weight Normalization in Accelerating Training of Deep Neural Networks

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Centered Weight Normalization

This project is the Torch implementation of the paper: Centered Weight Normalization in Accelerating Training of Deep Neural Networks ( ICCV 2017).

  • bibtex:
@INPROCEEDINGS{Huang2017ICCV,
    author = {Lei Huang and Xianglong Liu and Yang Liu and  Bo Lang and Dacheng Tao},
    title = {Centered Weight Normalization  in Accelerating Training of Deep Neural Networks},
    booktitle = {ICCV},
    year = {2017}}

Updates

Requirements and Dependency

  • Install Torch with CUDA GPU
  • Install cudnn v5
  • Install dependent lua packages optnet by run: luarocks install optnet

Experiments in the paper

1. MLP architecture over SVHN dataset

  • Dataset prepraration: We can get the preprocessed SVHN dataset for MLP architecture by running:
  cd dataset
   th preProcess_div256.lua

Note that this script is based on the Torch script for SVHN

  • Execute:
 th exp_MLP.lua 
  • To reproduce the experimental results, you can run the script below, which include all the information of experimental configurations:
 bash 1_execute_MLP_svhn.sh  
 bash 1_execute_MLP_svhn_adam.sh  

2. VGG-A architecture over Cifar-10 dataset

  • Dataset preparations: the dataset is based on the preprocessed script on: https://github.com/szagoruyko/cifar.torch, and you should put the data file in the directory: './dataset/cifar_provider.t7'

  • Execute:

 th exp_vggA.lua –dataPath './dataset/cifar_provider.t7'
  • To reproduce the experimental results, you can run the script below, which include all the information of experimental configurations:
 bash   2_execute_Conv_CIFAR10_vggA.sh

3. GoogLeNet architecture over Cifar datasets

th exp_GoogleNet_dataWhitening.lua –dataPath './dataset/cifar100_whitened.t7'
  • To reproduce the experimental results, you can run the script below, which include all the information of experimental configurations:
 3_execute_Conv_CIFAR100_GoogLeNet.sh 

The GoogLeNet model is based on the project on: https://github.com/soumith/imagenet-multiGPU.torch

4. Residual network architecture over Cifar datasets

  • Dataset preparations: The dataset is based on original CIFAR datasets, and the data file should be put in the directory: ./dataset/cifar_original.t7.
  • Execute:
th exp_res_dataNorm.lua –dataPath './dataset/cifar10_original.t7'
  • To reproduce the experimental results, you can run the script below, which include all the information of experimental configurations:
4_execute_Conv_CIFAR10_resnet.sh

The normlization of Cifar dataset is in the script th exp_res_dataNorm.lua. The residual network model and respective script are based on facebook ResNet.

5. GoogLeNet over ImageNet

This experiment is based on the project at: https://github.com/soumith/imagenet-multiGPU.torch.
The proposed model are in: './models/imagenet/'

Contact

huanglei@nlsde.buaa.edu.cn, Any discussions and suggestions are welcome!

About

This project is the Torch implementation of our ICCV 2017 paper: Centered Weight Normalization in Accelerating Training of Deep Neural Networks

License:BSD 2-Clause "Simplified" License


Languages

Language:Lua 97.6%Language:Shell 1.3%Language:Python 1.1%