leaderj1001 / Synthesizer-Rethinking-Self-Attention-Transformer-Models

Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How is the reproduced performance?

CheerM opened this issue · comments

Hi, just wondering how is the reimplemented performance for dense/random/hybrid synthesizers?