Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
CheerM opened this issue 3 years ago · comments
Hi, just wondering how is the reimplemented performance for dense/random/hybrid synthesizers?