Inefficiencies
aryamanarora opened this issue · comments
GPT-2 medium is extremely slow on M2 Macbook Air. Why?
GPT-2 medium runs decently on my machine now, but probably could be made more efficient. induction.py
uses some tricks with caching part of the forward pass to speed up interchanges.