heartcored98 / transformer_anatomy

Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

heartcored98/transformer_anatomy Stargazers