machelreid / subformer

The code for the Subformer, from the EMNLP 2021 Findings paper: "Subformer: Exploring Weight Sharing for Parameter Efficiency in Generative Transformers", by Machel Reid, Edison Marrese-Taylor, and Yutaka Matsuo

Home Page:https://arxiv.org/abs/2101.00234

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

machelreid/subformer Stargazers