wangkiw / ICLR23-MEMO

The code repository for "A Model or 603 Exemplars: Towards Memory-Efficient Class-Incremental Learning" (ICLR'23) in PyTorch

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

About outputting as nan

zhangziyi1670 opened this issue · comments

I encountered a problem during the process of reproducing this code. In the second training stage, the output of the old model for new data was nan. I debugged the code and found that the distribution of the model's output data was significantly different, ultimately leading to numerical overflow. May I ask what tricks were used in your implementation process to avoid such issues.