fahos / hessian-for-basicDL

can calculate the Hessian matrix and/or its spectrum for simple neural nets

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

hessian-for-basicDL

Developed in collaboration with vug. The repo can calculate the Hessian matrix and/or its spectrum for simple neural nets. It can also find the largest and smallest few eigenvalues without finding the full Hessian.

References

Some results using this code has been used in the following work:

[1] Levent Sagun, Utku Evci, V. Ugur Guney, Yann Dauphin, Leon Bottou, Empirical Analysis of the Hessian of Over-Parametrized Neural Networks

Relevant related works are:

[2] Levent Sagun, Leon Bottou, Yann LeCun, Eigenvalues of the Hessian in Deep Learning: Singularity and Beyond

[3] Pratik Chaudhari, Anna Choromanska, Stefano Soatto, Yann LeCun, Carlo Baldassi, Christian Borgs, Jennifer Chayes, Levent Sagun, Riccardo Zecchina, Entropy-SGD: Biasing Gradient Descent Into Wide Valleys

About

can calculate the Hessian matrix and/or its spectrum for simple neural nets


Languages

Language:Python 99.6%Language:Shell 0.4%