As it says on the tin, this repo has a simple implementation of a transformer model, with some borrowed efficiency improvements. The purpose is mainly pedagogical.
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool