check qadam NAN problem
wangraying opened this issue · comments
Rui Wang commented
qadam algorithm occasionally failed in CI using baguasys/bagua:master-pytorch-1.9.1-cuda11.1-cudnn8
image
Bagua Speeds up PyTorch
wangraying opened this issue · comments
qadam algorithm occasionally failed in CI using baguasys/bagua:master-pytorch-1.9.1-cuda11.1-cudnn8
image