ofirpress / sandwich_transformer

This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer Models by Reordering their Sublayers.

Home Page:https://ofir.io/sandwich_transformer.pdf

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ofirpress/sandwich_transformer Issues