wgrathwohl/JEM Issues
Model training terminated
Updated 6Training time 36 hours
Updated 2How to train JEM with batch norm
UpdatedPretrained Model
Closed 1Dealing with divergence
Updated 5Volatile accuracy
UpdatedEstimate log p(x,y)
Updated
Project site for "Your Classifier is Secretly an Energy-Based Model and You Should Treat it Like One"