XLNet: Generalized Autoregressive Pretraining for Language Understanding
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
guotong1988 opened this issue 4 years ago · comments
I used to guess reuse_len and mem_len should be same.
reuse_len
mem_len
#59