THUDM / GLM

GLM (General Language Model)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

The attention_mask dimension not right?

zhangyipin opened this issue · comments

commented

attention_mask[self.max_src_length:, self.max_src_length:] = np.tril(

should output attention mask dimension 3 but 2.
just like
image

Thank you for pointing that out. Already fixed.