WhiteBox Hackathon: Replicating We Found An Neuron and Extending to Subject-Verb Agreement
This repository contains the experiments in replicating We Found An Neuron, then extending it to see if the methodology could also be applied to other tasks such as subject-verb agreement (e.g., predicting if the next word should be "has" or "have").
Please see the presentation for more details.