fudan-zvg / SeaFormer

[ICLR 2023] SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Loss go NAN when running classification model of Seaformer_B

shiyutang opened this issue · comments

I run the following command as described in the readme, but the loss goes NAN. I wonder why this happens.
image

commented

Try to train the model with 8 gpus.

Thanks a lot, I must forget to change to 8 after test.

hi,I trained the model with 8 gpus, but also goes nan.
image
The --resume arg is not specified in the startup command, will this not affect it?

Thanks a lot, I must forget to change to 8 after test.

Have you solved this problem?

Yes, It needs to be trained with 8 gpus.

Yes, It needs to be trained with 8 gpus.

Thanks.