Mohammad8921 / UniformStabilityOfAdam

All codes of my master thesis conducted under computational learning theory

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Uniform Stability of Adam Algorithm and its Effect on Generalization Error of Deep Neural Networks

Info

This repo contains all of the codes of my theoretical thesis in MSc supervised by Dr. Amin Gheibi, conducted at Amirkabir University of Technology (Tehran Polytechnic).

Abstract

  • We theoretically connected the Lipschitz constant and maximum value of a loss function to the generalization error of deep learning models trained by the Adam and AdamW optimizers under the uniform stability theory.
  • Using the theoretical results, we proposed a novel loss function for training deep classification models to improve the generalization performance and overcome the over-fitting issue.
  • We assessed our theorems in human age estimation based on face images.
  • We trained deep neural networks using our new loss function in the image and node classification problems in order to stabilize the output models and increase their accuracy.
  • Requirements:
    • Python 3.7.8
    • SciPy 1.7.3
    • OpenCV 4.5.5
    • PyTorch 1.11
    • CUDA 11.3
    • Torchvision 0.12
    • PyTorch Geometric 2.1

Paper

About

All codes of my master thesis conducted under computational learning theory


Languages

Language:Jupyter Notebook 100.0%