MNIST training hangs in ApplyAdam kernel

Question

Aetf opened this issue 7 years ago · comments

This happens regardless the executor is using GPU or not.

Test passes

Executor blocks waiting for kernel to finish. In the mean time the GPU utilization is zero.
The block always happens in AdamApply operation.

Aetf · Answer 1 · Sat Aug 05 2017 04:38:53 GMT+0800 (China Standard Time)

The bug was introduced in f1b9331, found by git bisect

Aetf · Answer 2 · Mon Aug 07 2017 05:56:07 GMT+0800 (China Standard Time)