caiyuanhao1998 / MST-plus-plus

"MST++: Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction" (CVPRW 2022) & (Winner of NTIRE 2022 Spectral Recovery Challenge) and a toolbox for spectral reconstruction

Home Page:https://arxiv.org/abs/2204.07908

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Performance sensitive to Pytorch and CUDA environment.

darkbird1 opened this issue · comments

Hi, thanks for such an interesting work. I observed that the model performs well and good with pytorch 1.8 . However, when I try doing the same in latest pytorch 1.12 and CUDA 11.7, the same model starts overfitting on the training data, and test MRAE does not go down below 0.42. Similarily in pytorch 1.2, the training MRAE starts oscillating around 0.5 and does not change much. Does it mean that the performance of model is highly sensitive to the pytorch or cuda version??

Thanks for your appreciation. This is because our MST++ is implemented by torch 1.8. Some operations and functions change with the pytorch version. For instance, different versions use different methods to handle the singular value. Thus, it is better to train and test our models in our suggested environment.

Thanks for the response. From my own understanding, It seems that though there are some differences in the implementation aspects of different versions (after checking the updates in different versions), the corresponding effect on the performance is significant. It will be really interesting to theoretically investigate the sensitivity of different components to address the reproducibility on different platforms. Also, thanks for providing different baseline models' weights for comparison. I will do the similar analysis for other baselines in different environments

OK. Hope you find something interesting. Good luck.