wangyongjie-ntu / awesome-counterfactual-explanations

awesome counterfactual explanations

Repository from Github https://github.comwangyongjie-ntu/awesome-counterfactual-explanationsRepository from Github https://github.comwangyongjie-ntu/awesome-counterfactual-explanations

Counterfactual Explanation (Actionable Recourse)

Counterfactual explanations mainly target to find the mimimum perturbation which changes the original prediction(Ususlly from an undesirable prediction to ideal one). The perturbation itself is a valid instance following the real data distribution as the training samples. It has broad applications, E.g., finance, education, health care ect. Specifically, what should I do to get the credit card approved if I received the rejection. This task can be viewed as extracting knowledge/solutions from the black-box models. It belongs to the instance-level explanation. Quite interesting to dive deeper!!!

The two use cases of counterfactual explanations:

counterfactual explanations

Survey papers

Counterfactuals and Causability in Explainable Artificial Intelligence: Theory, Algorithms, and Applications, Arxiv preprint 2021

A Survey of Contrastive and Counterfactual Explanation Generation Methods for Explainable Artificial Intelligence, IEEE Access 2021

Good Counterfactuals and Where to Find Them: A Case-Based Technique for Generating Counterfactuals for Explainable AI (XAI), Arxiv preprint 2020

Counterfactual Explanations for Machine Learning: A Review, Arxiv preprint 2020

A survey of algorithmic recourse: definitions, formulations, solutions, and prospects, Arxiv preprint 2020

On the computation of counterfactual explanations -- A survey, Arxiv preprint 2019

Issues with post-hoc counterfactual explanations: a discussion, ICML Workshop 2019

Counterfactuals in Explainable Artificial Intelligence (XAI): Evidence from Human Reasoning, IJCAI 2019

Papers

A Few Good Counterfactuals: Generating Interpretable, Plausible & Diverse Counterfactual Explanations, Arxiv preprint 2021

GeCo: Quality Counterfactual Explanations in Real Time, Arxiv preprint 2021

Generating Interpretable Counterfactual Explanations By Implicit Minimisation of Epistemic and Aleatoric Uncertainties, AISTATS 2021, code

FIMAP: Feature Importance by Minimal Adversarial Perturbation, AAAI 2021

Generate Your Counterfactuals: Towards Controlled Counterfactual Generation for Text, AAAI 2021

Counterfactual Fairness with Disentangled Causal Effect Variational Autoencoder, AAAI 2021

Counterfactual Explanations for Oblique Decision Trees: Exact, Efficient Algorithms, AAAI 2021

Ordered Counterfactual Explanation by Mixed-Integer Linear Optimization, AAAI 2021

Interpretability Through Invertibility: A Deep Convolutional Network With Ideal Counterfactuals And Isosurfaces

Counterfactual Generative Networks, ICLR 2021, code

Learning "What-if" Explanations for Sequential Decision-Making, ICLR 2021

GRACE: Generating Concise and Informative Contrastive Sample to Explain Neural Network Model’s Prediction, KDD 2020 code

An ASP-Based Approach to Counterfactual Explanations for Classification, RuleML + PR 2020

DECE: Decision Explorer with Counterfactual Explanations for Machine Learning Models, TVCG 2020

Good Counterfactuals and Where to Find Them: A Case-Based Technique for Generating Counterfactuals for Explainable AI (XAI), ICCBR 2020

Learning Global Transparent Models from Local Contrastive Explanations, NeurIPS 2020

Decisions, Counterfactual Explanations and Strategic Behavior, NeurIPS 2020

Algorithmic recourse under imperfect causal knowledge a probabilistic approach, NeurIPS 2020

Beyond Individualized Recourse: Interpretable and Interactive Summaries of Actionable Recourses, NeurIPS 2020

Learning What Makes a Difference from Counterfactual Examples and Gradient Supervision, ECCV 2020

Counterfactual Vision-and-Language Navigation via Adversarial Path Sampler, ECCV 2020

CERTIFAI: Counterfactual Explanations for Robustness, Transparency, Interpretability, and Fairness of Artificial Intelligence models, AAAI/AIES 2020

FACE: Feasible and Actionable Counterfactual Explanation, AAAI/AIES 2020

Counterfactual Explanations & Adversarial Examples, Arxiv preprint 2020

Instance-Based Counterfactual Explanations for Time Series Classification, Arxiv preprint 2020

On Relating 'Why?' and 'Why Not?' Explanations, Arxiv preprint 2020

Model extraction from counterfactual explanations, Arxiv preprint 2020

Efficient computation of counterfactual explanations of LVQ models, Arxiv preprint 2020

Plausible Counterfactuals: Auditing Deep Learning Classifiers with Realistic Adversarial Examples, Arxiv preprint 2020

ViCE: Visual Counterfactual Explanations for Machine Learning Models, IUI 2020

Model extraction from counterfactual explanations, Arxiv 2020

On the Fairness of Causal Algorithmic Recourse, Arxiv preprint 2020

Scaling Guarantees for Nearest Counterfactual Explanations, Arxiv preprint 2020

PermuteAttack: Counterfactual Explanation of Machine Learning Credit Scorecards, Arxiv preprint 2020

Decisions, Counterfactual Explanations and Strategic Behavior, Arxiv preprint 2020

FOCUS: Flexible Optimizable Counterfactual Explanations for Tree Ensembles, Arxiv preprint 2020

Interpretable and Interactive Summaries of Actionable Recourses, Arxiv preprint 2020

Counterfactual Explanation Based on Gradual Construction for Deep Networks, Arxiv preprint 2020

CounteRGAN: Generating Realistic Counterfactuals with Residual Generative Adversarial Nets, Arxiv preprint 2020

EXPLAINABLE IMAGE CLASSIFICATION WITH EVIDENCE COUNTERFACTUAL, Arxiv preprint 2020

Model-Agnostic Counterfactual Explanations for Consequential Decisions, AISTATS 2020

Explaining Data-Driven Decisions made by AI Systems: The Counterfactual Approach, Arxiv preprint 2020

On the Fairness of Causal Algorithmic Recourse, Arxiv preprint 2020

On Counterfactual Explanations under Predictive Multiplicity, UAI 2020

EXPLANATION BY PROGRESSIVE EXAGGERATION, ICLR 2020

Algorithmic Recourse: from Counterfactual Explanations to Interventions, Arxiv preprint 2020

Learning Model-Agnostic Counterfactual Explanations for Tabular Data, ACM WWW 2020, code

The hidden assumptions behind counterfactual explanations and principal reasons, ACM Facct 2020

Explaining Machine Learning Classifiers through Diverse Counterfactual Explanations, ACM Facct 2020

The philosophical basis of algorithmic recourse, ACM Facct 2020

Why Does My Model Fail? Contrastive Local Explanations for Retail Forecasting, ACM Facct 2020

Convex Density Constraints for Computing Plausible Counterfactual Explanations, ICANN 2020

Fast Real-time Counterfactual Explanations, ICML 2020 Workshop

CRUDS: Counterfactual Recourse Using Disentangled Subspaces, ICML 2020 Workshop

DACE: Distribution-Aware Counterfactual Explanation by Mixed-Integer Linear Optimization, IJCAI 2020

Relation-Based Counterfactual Explanations for Bayesian Network Classifiers, IJCAI 2020

PRINCE: Provider-side Interpretability with Counterfactual Explanations in Recommender Systems, WSDM 2020

CoCoX: Generating Conceptual and Counterfactual Explanations via Fault-Lines, AAAI 2020

SCOUT: Self-aware Discriminant Counterfactual Explanations, CVPR 2020, code

Multi-Objective Counterfactual Explanations, PPSN 2020

EMAP: Explanation by Minimal Adversarial Perturbation, AAAI 2020

Random forest explainability using counterfactual sets, Information Fusion 2020

The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations, IJCAI 2019

Multimodal Explanations by Predicting Counterfactuality in Videos, CVPR 2019

Unjustified Classification Regions and Counterfactual Explanations In Machine Learning, ECML-PDKK 2019

Factual and Counterfactual Explanations for Black Box Decision Making, IEEE Intelligent Systems 2019

Model Agnostic Contrastive Explanations for Structured Data, Arxiv preprint 2019

Counterfactuals uncover the modular structure of deep generative models, Arxiv preprint 2019

Actionable Interpretability through Optimizable Counterfactual Explanations for Tree Ensembles, arxiv preprint 2019

Towards Realistic Individual Recourse and Actionable Explanations in Black-Box Decision Making Systems, Arxiv preprint 2019

Generative Counterfactual Introspection for Explainable Deep Learning, Arxiv preprint 2019

Generating Counterfactual and Contrastive Explanations using SHAP, Arxiv preprint 2019

Interpretable Counterfactual Explanations Guided by Prototypes, Arxiv preprint 2019

Counterfactual Visual Explanations, ICML 2019

Counterfactual Explanation Algorithms for Behavioral and Textual Data, Arxiv preprint 2019

Synthesizing Action Sequences for Modifying Model Decisions, Arxiv preprint 2019

Measurable Counterfactual Local Explanations for Any Classifier, Arxiv preprint 2019

Generating Contrastive Explanations with Monotonic Attribute Functions, arxiv preprint 2019

The What-If Tool: Interactive Probing of Machine Learning Models, TVCG 2019

Preserving Causal Constraints in Counterfactual Explanations for Machine Learning Classifiers, NeurIPS Workshop 2019

Actionable Recourse in Linear Classification, ACM Facct 2019

EQUALIZING RECOURSE ACROSS GROUPS, Arxiv preprint 2019

Efficient Search for Diverse Coherent Explanations, ACM Facct 2019

Counterfactual explanations of machine learning predictions: opportunities and challenges for AI safety, SafeAI@AAAI 2019

Explaining image classifiers by counterfactual generation, ICLR 2019

Counterfactual Explanations without Opening the Black Box: Automated Decisions and the GDPR, Hardvard Journal of Law & Technology 2018 (strong recommend)

Explanations based on the Missing: Towards Contrastive Explanations with Pertinent Negatives, NIPS 2018, code

Interpreting Neural Network Judgments via Minimal, Stable, and Symbolic Corrections, NIPS 2018

Interpretable Credit Application Predictions With Counterfactual Explanations, Arxiv preprint 2018

Comparison-based Inverse Classification for Interpretability in Machine Learning, IPMU 2018

Generating Counterfactual Explanations with Natural Language, ICML 2018 Workshop

Grounding Visual Explanations, ECCV 2018

Interpreting Neural Network Judgments via Minimal, Stable, and Symbolic Corrections, NeurIPS 2018

Inverse Classification for Comparison-based Interpretability in Machine Learning, Arxiv 2017

Interpretable Predictions of Tree-based Ensembles via Actionable Feature Tweaking, KDD 2017

When Worlds Collide: Integrating Different Counterfactual Assumptions in Fairness, NeurIPS 2017

A budget-constrained inverse classification framework for smooth classifiers, ICDMW 2017

Generalized Inverse Classification, SIAM 2017

Explaining Data-Driven Document Classifications, MIS Quarterly 2014

The Inverse Classification Problem, Journal of Computer Science and Technology 2010

The cost-minimizing inverse classification problem: a genetic algorithm approach, Decision Support Systems 2000

Github Repos

Alibi: https://github.com/SeldonIO/alibi

actionable-recourse: https://github.com/ustunb/actionable-recourse, Scikit-Learn

CEML: https://github.com/andreArtelt/ceml, Pytorch, Keras, Tensorflow, Scikit-Learn

Dice: https://github.com/interpretml/DiCE.git, Pytorch, TensorFlow

ContrastiveExplanation: https://github.com/MarcelRobeer/ContrastiveExplanation, scikit-learn,

cf-feasibility: https://github.com/divyat09/cf-feasibility, Pytorch, Tensorflow, Scikit-Learn,

Mace: https://github.com/amirhk/mace, Scikit-Learn,

Strategic-Decisions: https://github.com/Networks-Learning/strategic-decisions, Scikit-learn

Contrastive-Explanation-Method: https://github.com/IBM/Contrastive-Explanation-Method, Tensorflow

About

awesome counterfactual explanations

License:Apache License 2.0