Try to reproduce the transformer model described in the "Attention is all you need" paper using tensor2tensor
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool