vishal3477 / Reverse_Engineering_GMs

Official Pytorch implementation of paper "Reverse Engineering of Generative Models: Inferring Model Hyperparameters from Generated Images"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

question about the Neural network explosion

zhangtzq opened this issue · comments

Hi, I have reproduced the code for the image attribution. I get the neural network explosion during training. Surprisingly, I got an accuracy of about 85% before the explosion. I want to know why. I observe that when the explosion happens, the weights of the net obtain nan and the CE loss increases. Can you help me to solve the issue? I used the Adam optimizer.

Hi,
Can you provide more details about the error? Usually, because of using fft functions of pytorch. I would advise you to restart from the last good checkpoint with a lower learning rate to not allow the model change much.