Neuron Ablation in Model M

Question

Neuron Ablation in Model M

Andrea-de-Varda opened this issue 2 years ago · comments

My apologies if I bother you again. In your paper What Is One Grain of Sand in the Desert? Analyzing Individual Neurons in Deep NLP Models you describe the results you obtained by ablating N% neurons in the model, and evaluating its performance (BLEU, perplexity) without a probe. However, I cannot find any documentation relative to that part. Are you planning to share it?

Thank you!

Fahim Dalvi · Answer 1 · Sun Jun 05 2022 16:54:08 GMT+0800 (China Standard Time)

Thanks for reaching out! Do you mean the code for ablating in the machine translation model (OpenNMT-py) itself?

Andrea de Varda · Answer 2 · Sun Jun 05 2022 19:39:38 GMT+0800 (China Standard Time)

Yes, actually I wanted to do that with BERT, but that would be a good place to start

Fahim Dalvi · Answer 3 · Tue Jun 07 2022 22:03:31 GMT+0800 (China Standard Time)

The ablation experiments in the paper were specifically run using code in https://github.com/dabbler0/seq2seq-attn-modify as far as I recall, but this was targeting a (now) really old lua version of seq2seq. We are working on bringing the ability to ablate/set neurons in transformer models, but haven't had the resources to finish the implementation yet unfortunately.

Is this something you might be potentially interested in implementing? We are happy to provide support and help out!

Andrea de Varda · Answer 4 · Wed Jun 08 2022 20:47:21 GMT+0800 (China Standard Time)

I would really love to help, but unfortunately I don't think I am capable of doing something like that, I am very sorry!

Fahim Dalvi · Answer 5 · Thu Jun 09 2022 16:28:34 GMT+0800 (China Standard Time)

No worries, I understand. Keep an eye on the CHANGELOG (https://github.com/fdalvi/NeuroX/blob/master/CHANGELOG.md) for when we add it (hopefully soon in the coming months!)