The code for the Subformer, from the EMNLP 2021 Findings paper: "Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers", by Machel Reid, Edison Marrese-Taylor, and Yutaka Matsuo
Home Page:https://arxiv.org/abs/2101.00234
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool