The gradient is missing sometimes

Question

The gradient is missing sometimes

SUNBERG010 opened this issue 5 years ago · comments

Dear Zhao,
Recently I tried your self-attention layer for my work, I really appreciate it.

However, the decrease of gradient sometimes got stuck from the beginning; I tried some ways, however, it is not stable, could you please give some suggestions?

Best wishes,
Sunberg

Shreeyash Geda · Answer 1 · Mon Jul 22 2019 15:18:48 GMT+0800 (China Standard Time)

same issue. please help.

stale · Answer 2 · Sat Jul 27 2019 16:07:56 GMT+0800 (China Standard Time)

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.