Is this assertion for mask wrong?
yinfangchen opened this issue · comments
Yinfang Chen commented
I got an AssertionError: Mask is silently ignored due to the use of a custom kernel
when training GPT-2 with examples/pretrain_gpt.sh
.
This line leads to the assertion error:
Is this assertion necessary? And is it even correct?
LordEdison commented
same puzzlement