Baselines

Sam Greydanus. 2018. MIT License.

Written in PyTorch

About

A set of self-contained Jupyter notebooks aimed at providing quick and easy-to-modify baselines. Specifically:

mnist-np a 3-layer dense relu network implemented entirely in NumPy (test acc: 96.8%)
- run the code on Colab
mnist-fc a 3-layer dense relu network implemented in PyTorch (test acc: 96.8%)
- run the code on Colab
mnist-cnn a 3-layer cnn relu network implemented in PyTorch (test acc: 98.7%)
- run the code on Colab
mnist-seq a 3-layer dense recurrent (GRU) network implemented on a sequentual MNIST task
- run the code on Colab

I only trained these models on 1-3 epochs of MNIST, so they could probably do a little better. I wanted these baselines to be:

something I could train in ~2 minutes on my laptop
something that captured the general idea (of backprop, of neural nets, of cnns, of seq2seq models respectively)
something very reproducible (I save training stats and models along the way, etc.)
something easy to visualize/understand
- The code is minimal
- I plot training stats and a few examples at the end
- Each notebook is self-contained (minimal dependencies)

-> I don't include pretrained models for the MNIST baselines because each notebook runs in <5 minutes on a 2014 MacBook.

Pure numpy classifier

Dense classifier (PyTorch)

Convolutional classifier (PyTorch)

Sequential Model (PyTorch)

Visualizing sequential Model (PyTorch)

Simple MNIST baselines for 1) numpy backprop 2) dense nns 3) cnns 3) seq2seq

Language:Jupyter Notebook 100.0%