aryamanarora / nano-causal-interventions

Extremely simple implementation of path patching (aka causal scrubbing) in PyTorch.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Inefficiencies

aryamanarora opened this issue · comments

GPT-2 medium is extremely slow on M2 Macbook Air. Why?

GPT-2 medium runs decently on my machine now, but probably could be made more efficient. induction.py uses some tricks with caching part of the forward pass to speed up interchanges.