huawei-noah / AdderNet

Code for paper " AdderNet: Do We Really Need Multiplications in Deep Learning?"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Migrating from a Simple CNN Architecture to AdderNet

alarst13 opened this issue · comments

Hi. I have a CNN architecture that I trained on CIFAR-10 with and without AdderNet. I could reach an accuracy of over %80 without AdderNet but when I used AdderNet it got stuck at %10 accuracy. Is there anything wrong with my implementation? All I did was to replace nn.conv2D with adder.adder2d. Isn't it supposed to work like this? How do you suggest I should migrate from a simple CNN architecture to AdderNet? Thank you!

During the train_test the accuracy stayed the same at %10 for 100 epochs.

The problem was that I wasn't using Batch Normalization which we were instructed to use in the paper,