rushi-the-neural-arch / SlimNets

Various implementations and experimentation for deep neural network model compression

SlimNets

A comparison of popular methods to create more efficient (smaller and faster) Nueral Networks.

Framework: Pytorch

Dataset: CIFAR10

Model: VGGnet11,16,19

Methods:

Gradual Pruning (sparse)
Low-Rank Factorization
Knowledge Distillation

Absolute Metrics:

Model size (weights)
Test Accuracy
Training Time (h)
Inference Time (s)
Runtime model size

Relative Metrics:

Model compression rate
Training/inference accelerations
Relative accuracy between small and large models.

References:

Gradual Pruning Progression Curves based on:

M. Zhu and S. Gupta, “To prune, or not to prune: exploring the efficacy of pruning for model compression,” ArXiv e-prints, Oct. 2017.

VGG 19 Loss Progression with Gradual Pruning

About

Various implementations and experimentation for deep neural network model compression

Languages

Language:Jupyter Notebook 72.1%Language:Python 27.9%