Migrating from a Simple CNN Architecture to AdderNet

Question

Migrating from a Simple CNN Architecture to AdderNet

alarst13 opened this issue a year ago · comments

Hi. I have a CNN architecture that I trained on CIFAR-10 with and without AdderNet. I could reach an accuracy of over %80 without AdderNet but when I used AdderNet it got stuck at %10 accuracy. Is there anything wrong with my implementation? All I did was to replace nn.conv2D with adder.adder2d. Isn't it supposed to work like this? How do you suggest I should migrate from a simple CNN architecture to AdderNet? Thank you!

Ali Arastehfard · Answer 1 · Tue Dec 27 2022 08:20:41 GMT+0800 (China Standard Time)

During the train_test the accuracy stayed the same at %10 for 100 epochs.

Ali Arastehfard · Answer 2 · Thu Feb 09 2023 03:55:14 GMT+0800 (China Standard Time)

The problem was that I wasn't using Batch Normalization which we were instructed to use in the paper,