Cannot reproduce results on image_generation_pyotch

Question

Cannot reproduce results on image_generation_pyotch

k4ntz opened this issue 3 years ago · comments

Hi, I have tried to reproduce the results of the paper for image generation (CIFAR10).
I launched main.py with the default parameters, just varying the activation_fn param.
I get an IS score of:

5.6 using Activation fn
3.7 with the same model that does not use any activation function.

Do you have any recommendation on the parameters to be used to reproduce, and/or any recommendation if I try to create a PolyNet to classify CIFAR10 images ? I also don't get very good results on this task

Grigoris · Answer 1 · Sun Aug 08 2021 21:52:30 GMT+0800 (China Standard Time)

Hi,

thanks for your interest in our work. I believe that the model you are using is simply the conversion of the DCGAN generator into a polynomial, right? In addition, the scores of FID/IS are typically reported in papers using the Tensorflow inception network; the one provided for pytorch does not have exactly the same correspondence with human vision.

In other words, it depends what you are looking to replicate exactly; however, the results of IS score > 8 that we report in the paper are not made with DCGAN-polynomials, but rather a custom polynomial architecture. I could share more details if you believe this is helpful. Unfortunately the corresponding experiments are done in Chainer, so we do not have the exact network in PyTorch.

Let us know if something is unclear.

Jiajie Li · Answer 2 · Sat Nov 19 2022 04:13:28 GMT+0800 (China Standard Time)

@grigorisg9gr Hi, are the poly-nets used in the imagenet experiments free of any non-linear activations such as ReLU?

(edit) OK, I saw the residual blocks are "normalized" by tanh.

Grigoris · Answer 3 · Sat Nov 19 2022 14:20:36 GMT+0800 (China Standard Time)

Hi,
which experiments are you referring to? Classification or generation? In both cases, in the imagenet experiments we modified standard architectures, e.g. StyleGAN in the generative case, so we maintained their activation functions.

Jiajie Li · Answer 4 · Sat Nov 19 2022 14:23:42 GMT+0800 (China Standard Time)

Hi, @grigorisg9gr . It's the classification experiments on ImagNet. Are tanh functions are used in the polyprod resent? Specifically, in every residual block?

Grigoris · Answer 5 · Sat Nov 19 2022 14:29:05 GMT+0800 (China Standard Time)

If you are referring to the version in T-PAMI (i.e. https://arxiv.org/pdf/2006.13026.pdf), yes we do have a tanh in each block. This tanh was added to stabilise the training.

Jiajie Li · Answer 6 · Sat Nov 19 2022 14:32:19 GMT+0800 (China Standard Time)

Thanks, @grigorisg9gr