Transformer-XL with byte-pair encoding (BPE) on large datasets. Adapted from https://github.com/kimiyoung/transformer-xl
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool