Reproducing and Modularizing The Building Blocks of Interpretability

A quarter-long project for Northwestern University's Deep Learning for Practitioners course. We attempt to reproduce the paper The Building Blocks of Interpretability (Olah et al, 2018) with the aim of making the original authors' code usable, modular, and extensible.

You can view our Tips and Tricks for reproducing the work of Olah et al, as well our Reproducibility Report for the paper.

Team: Nayan Mehta, KJ Schmidt, Aristana Scourtas, Neelanshi Varia

Paper: The Building Blocks of Interpretability

Setup
- Troubleshooting
Paper info
Literature review
Codebase search
Other useful resources
Reproducibility review
- Other reproducibility notes
- Overall reproducibility
Timeline
References

Setup

Lucid, the primary visualization library, does not support TF2 and is written in Python 3.6.

This repo uses Lucid 0.3.8, Tensorflow 1.15, and Python 3.7, run on macOS Mojave. All instructions are for Mac OS.

All dependencies can easily be installed using pipenv; if you do not have pipenv installed, do so via:

pip install pipenv

To install all dependencies, run

pipenv install

If you pull an updated Pipfile or Pipfile.lock, be sure to run pipenv install to update all dependencies accordingly. The Pipfile should lock automatically whenever you run the install command.

Troubleshooting

If you receive an error about the tbb package while locking, run pipenv lock --clear to clear the cache and it should lock all right. Once you receive a "Success", your environment is all good to go.

If you receive an "SSL: CERTIFICATE_VERIFY_FAILED" error upon execution, run the following to install the certifi package:

/Applications/Python\ 3.7/Install\ Certificates.command

This is a known issue with Python 3.6 on macOS

NOTE: This repo currently uses Python 3.7 because brew does not have a formula for Python 3.6 above 3.6.5; if this poses an issue with Lucid we may downgrade to Python 3.6

Paper info

Title: The Building Blocks of Interpretability
Link: https://distill.pub/2018/building-blocks/
Author: Chris Olah et. al. (Google Brain)
Author contact: christopherolah.co@gmail.com
Code repo: https://github.com/distillpub/post--building-blocks

Literature review

Most of the Deep Learning papers are mere blackbox in terms of interpretability, decision making, bias and other such aspects. [1] The models give great results but the papers are hardly backed by an explanation of the model’s reasoning behind those results/decisions and mainly talk about the mathematical derivations that led to the architecture. Apart from a few papers which are primarily focused on interpretability, it is usually difficult to find a discussion of interpretability embedded in papers [4]. We were able to find a book on the interpretability of ML which explains importance, methods and other aspects in detail. [10]The following are a few papers in the area:

Understanding Neural Networks Through Deep Visualization [2] The paper talks about using two visualization tools to better interpret neural nets, the first one visualizes the activation produced at each layer of a trained Conv net as it takes in image/video and the second allows for visualization of each feature for each layer in the DNN via regularized optimization.

Using Artificial Intelligence to Augment Human Intelligence [3] With the advancement in various interactive visualization environments, a lot of dynamic explanations have been emerging. This particular article explains the working behind GANs and interpretation of the intermediary steps for it’s creative applications. They explain this working under a bigger umbrella of questions - what computers are for, and how this relates to intelligence augmentation.

Other similar papers we found which were quite interesting -

An Evaluation of the Human-Interpretability of Explanation [5]
“Why Should I Trust You?” Explaining the Predictions of Any Classifier [6]
Visualising Deep Neural Network Decisions: Prediction Difference Analysis [7]
Human-in-the-Loop Interpretability Prior [8]
Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps [9]

Here is the original paper for GoogLeNet (aka Inception v1), which is the model used in our selected paper:

Going Deeper with Convolutions [11] Describes the implementation of a CNN trained on ImageNet that was at the time (2015) considered state of the art for object detection and classification. “The main hallmark of this architecture is the improved utilization of the computing resources inside the network.” Since this paper, there have been 2 subsequent improved implementations of GoogLeNet: Inception v2 and Inception v3.

Codebase search

CoLabs used in our paper [In order in which they appear]:
Lucid -- lib used/written by authors for visualizing neural networks:
- https://github.com/tensorflow/lucid
Some tutorials and all of the Colab notebooks created by the authors here:
- https://github.com/tensorflow/lucid#notebooks
Relevant model implementations:
- GoogLeNet (v1) implementation in Keras (tbd how reliable this is, we should read through the comments)
  - https://gist.github.com/joelouismarino/a2ede9ab3928f999575423b9887abd14
- Another implementation of GoogLeNet (v1) (TF)
  - https://github.com/conan7882/GoogLeNet-Inception
- Inception v2 and v3 implementations
  - https://github.com/fchollet/deep-learning-models
Relevant pretrained (presumably on ImageNet) models available:
- Load pretrained GoogLeNet (v1) and Inception v3 via PyTorch (NOTE: I think we can also use InceptionResNetv2 even though it’s not listed here)
  - https://pytorch.org/docs/stable/torchvision/models.html
- Alternative PyTorch-compatible GoogLeNet:
  - https://pypi.org/project/googlenet-pytorch/
- Pretrained GoogLeNet (v1) with Caffe
  - https://caffe.berkeleyvision.org/model_zoo.html
- Pretrained Inception v2 and v3 in Keras how-to
  - https://gogul.dev/software/flower-recognition-deep-learning

Other useful resources

Paper appendix, with info about other channels of GoogleNet
https://distill.pub/2017/feature-visualization/appendix/
Twitter walkthrough of the neuron visualizations by the author
https://twitter.com/ch402/status/927968700384153601
Book on interpretability in ML
https://christophm.github.io/interpretable-ml-book/
Difference between GoogLeNet and Inception v2 and v3
https://towardsdatascience.com/a-simple-guide-to-the-versions-of-the-inception-network-7fc52b863202
More concise explanation of differences between GoogLeNet and Inception v2 and v3
https://datascience.stackexchange.com/questions/15328/what-is-the-difference-between-inception-v2-and-inception-v3

Reproducibility review

We include here the statistically significant features that corresponded to paper reproducibility from (Raff, 2019).

Rigor vs Empirical: empirical (which is the more reproducible outcome)
Readability: “Excellent.” We feel that we will be able to reproduce the code in a single read.
Algorithm Difficulty: Medium
Pseudo Code: Not present in the article, code contains comments but no explicit pseudocode. Absence of pseudocode was associated with reproducible outcomes.
Primary Topic: Interpretability of Neural Nets; was very easy to identify
Hyperparameters Specified: N/A
Compute Needed: Works easily on Google Colab, does require GPUs
Authors Reply: not checked yet - but they encourage issuing requests on Github, which seems like a sign that they might reply/address the issue. They have also replied to previous reviews.
Number Equations: 0 (in the article), which corresponds to better reproducibility
Number Tables: 4, although we’re defining table loosely here as most of the “tables” are interactive visualizations. However, we think this meets the spirit of the feature.

Other reproducibility notes

Author has some code that is not available: https://twitter.com/ch402/status/928083043654320128?s=20

Overall reproducibility

The paper seems highly reproducible given the criteria set forth by (Raff, 2019). Of note, we would need GPU compute power to run this, but we can do so in Colab. Rewriting the code into Python modules would take some work and create the need for a new compute resource, but this is manageable. Pretrained versions of GoogLeNet and its subsequent improvements (Inception v2 and v3) are available via PyTorch for ease of use. The paper is very easy to read and understand, and the visualizations provided are well documented and very clear.

Timeline

Week 3 (4/20-4/26):

Pick project paper
Project proposal writeup

Week 4 (4/27-5/3):

Read the paper completely and get a general overview
Note down the unclear major concepts and discuss

Week 5 (5/4-5/10)

Figure out which notebooks are dependent on which
Set up project (e.g. create environment, establish project structure, etc.)
Each person starts converting one notebook to Python modules

Week 6 (5/11-5/17)

Finish converting notebooks from last week to modules
Hook up all components to create end-to-end pipeline, with a simple script for running

Week 7 (5/18-5/24)

Improve abstraction/modularity and run script from Week 6
Make the code model-agnostic, try it with different models, record results

Week 8 (5/25-5/31)

Handle overflow work from earlier weeks, fix bugs, etc.
Start web-component visualizations via Svelte (may be tackled in earlier weeks as well)

Week 9 (6/1-6/7)

Continue web-component visualizations via Svelte
Wrap up

Finals Week (6/8-6/13)

Compare and report results
Conclusion

References

[1]Yu, R., & Alì, G. (2019). What's Inside the Black Box? AI Challenges for Lawyers and Researchers. Legal Information Management, 19(1), 2-13. doi:10.1017/S1472669619000021
[2]http://yosinski.com/media/papers/Yosinski__2015__ICML_DL__Understanding_Neural_Networks_Through_Deep_Visualization__.pdf
[3] https://distill.pub/2017/aia/
[4] https://arxiv.org/pdf/1710.04806.pdf
[5]https://arxiv.org/pdf/1902.00006.pdf
[6]https://arxiv.org/pdf/1602.04938.pdf
[7]https://arxiv.org/pdf/1702.04595.pdf
[8]https://arxiv.org/pdf/1805.11571.pdf
[9]https://arxiv.org/pdf/1312.6034.pdf
[10]https://christophm.github.io/interpretable-ml-book/
[11] https://arxiv.org/pdf/1409.4842.pdf

VNeelanshi / dlp