Reproducing the Linear Multihead Attention introduced in Linformer paper (Linformer: Self-Attention with Linear Complexity)
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool