Training Data Influence Analysis & Estimation Resources

This repository details resources related to training-set influence analysis. For a full taxonomy and comparison of training-set influence analysis methods and estimators, please see our survey paper.

We will regularly update this repository as new research is published. To have your paper included in this repository, you can send me an email at zayd.hammoudeh@gmail.com, submit a pull request, or fill out this Google form.

What is Training Data Influence Analysis?

Influence analysis studies how to apportion credit (and blame) for specific model behavior to individual or groups of training instances. Important tasks that influence analysis contribute to include:

Understanding why a trained model behaves in a specific way
Identifying limitations/vulnerabilities in the training data
Determining whether a prediction is well supported by the training set

Citation

If you found this repo or our survey useful, please cite our paper as follows:

@article{Hammoudeh:2022:InfluenceSurvey,
  author = {Zayd Hammoudeh and 
            Daniel Lowd},
  title = {Training Data Influence Analysis and Estimation: A Survey},
  archivePrefix = {arXiv},
  eprint = {2212.04612},
  primaryClass = {cs.LG},
  year = {2022},
}

Survey

Training Data Influence Analysis and Estimation: A Survey [link]
- Zayd Hammoudeh and Daniel Lowd. Machine Learning, 2024.
Unifying Corroborative and Contributive Attributions in Large Language Models [link]
- Theodora Worledge, Judy Hanwen Shen, Nicole Meister, Caleb Winston, and Carlos Guestrin. NeurIPS Attribution Workshop, 2023.

Retraining-Based Methods

Click on the arrow next to the year show/hide the corresponding papers.

2024

The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes [link] [code]
- Myeongseob Ko, Feiyang Kang, Weiyan Shi, Ming Jin, Zhou Yu, and Ruoxi Jia. 2024.
Thresholding Data Shapley for Data Cleansing Using Multi-Armed Bandits [link]
- Hiroyuki Namba, Shota Horiguchi, Masaki Hamamoto, and Masashi Egi. 2024.

2023

Threshold KNN-Shapley: A Linear-Time and Privacy-Friendly Approach to Data Valuation [link]
- Jiachen T. Wang, Yuqing Zhu, Yu-Xiang Wang, Ruoxi Jia, and Prateek Mittal. 2023.
2D-Shapley: A Framework for Fragmented Data Valuation [link] [code]
- Liu Zhihong, Hoang Anh Just, Xiangyu Chang, Xi Chen, and Ruoxi Jia. ICML, 2023.
From Shapley Values to Generalized Additive Models and Back [link] [code]
- Sebastian Bordt and Ulrike von Luxburg. AISTATS, 2023.
Faith-Shap: The Faithful Shapley Interaction Index [link]
- Che-Ping Tsai, Chih-Kuan Yeh, and Pradeep Ravikumar. JMLR, 2023.
Training Data Attribution for Diffusion Models [link] [code]
- Zheng Dai and David K Gifford. 2023.
A Bayesian Perspective On Training Data Attribution [link]
- Elisa Nguyen, Minjoon Seo, and Seong Joon Oh. 2023.
A Note on "Efficient Task-Specific Data Valuation for Nearest Neighbor Algorithms" [link]
- Jiachen T. Wang and Ruoxi Jia. 2023.
A Note on "Towards Efficient Data Valuation Based on the Shapley Value" [link]
- Jiachen T. Wang and Ruoxi Jia. 2023.
Data Banzhaf: A Robust Data Valuation Framework for Machine Learning [link]
- Jiachen T. Wang and Ruoxi Jia. AISTATS, 2023.
ModelPred: A Framework for Predicting Trained Model from Training Data [link] [video]
- Yingyan Zeng, Jiachen T. Wang, Si Chen, Hoang Anh Just, Ran Jin, and Ruoxi Jia. SaTML, 2023.
FaShapley: Fast and Approximated Shapley Based Model Pruning Towards Certifiably Robust DNNs [link] [code]
- Mintong Kang, Linyi Li, and Bo Li. SaTML, 2023.

2022

Datamodels: Predicting Predictions from Training Data [link] [blog] [data]
- Andrew Ilyas, Sung Min Park, Logan Engstrom, Guillaume Leclerc, and Aleksander Madry. ICML, 2022.
Measuring the Effect of Training Data on Deep Learning Predictions via Randomized Experiments [link]
- Jinkun Lin, Anqi Zhang, Mathias Lecuyer, Jinyang Li, Aurojit Panda, and Siddhartha Sen. ICML, 2022.
Beta Shapley: A Unified and Noise-Reduced Data Valuation Framework for Machine Learning [link] [code] [video]
- Yongchan Kwon and James Zou. AISTATS, 2022.
Private Data Valuation and Fair Payment in Data Marketplaces [link]
- Zhihua Tian, Jian Liu, Jingyu Li, Xinle Cao, Ruoxi Jia, Jun Kong, Mengdi Liu, and Kui Ren. 2022.

2021

Counterfactual Memorization in Neural Language Models [link]
- Chiyuan Zhang, Daphne Ippolito, Katherine Lee, Matthew Jagielski, Florian Tramèr, and Nicholas Carlini. 2021.
Characterizing Structural Regularities of Labeled Data in Overparameterized Models [link] [code] [video]
- Ziheng Jiang, Chiyuan Zhang, Kunal Talwar, and Michael C Mozer. ICML, 2021.
If You Like Shapley Then You’ll Love the Core [link] [video]
- Tom Yan and Ariel D. Procaccia. AAAI, 2021.
Scalability vs. Utility: Do We Have to Sacrifice One for the Other in Data Importance Quantification? [link] [code]
- Ruoxi Jia, Fan Wu, Xuehui Sun, Jiacen Xu, David Dao, Bhavya Kailkhura, Ce Zhang, Bo Li, and Dawn Song. CVPR, 2021.

2020

The Shapley Taylor Interaction Index [link] [video]
- Kedar Dhamdhere, Ashish Agarwal, and Mukund Sundararajan, ICML, 2020.
What Neural Networks Memorize and Why: Discovering the Long Tail via Influence Estimation [link] [code] [video]
- Vitaly Feldman and Chiyuan Zhang. NeurIPS, 2020.
Data Valuation using Reinforcement Learning [link] [code] [video]
- Jinsung Yoon, Sercan Arik, and Tomas Pfister. ICML, 2020.
A Distributional Framework for Data Valuation [link] [code] [video]
- Amirata Ghorbani, Michael P. Kim, and James Zou. ICML, 2020.
Does Learning Require Memorization? A Short Tale about a Long Tail [link]
- Vitaly Feldman. STOC, 2020.

2019

Data Shapley: Equitable Valuation of Data for Machine Learning [link] [code] [video]
- Amirata Ghorbani and James Zou. ICML, 2019.
Towards Efficient Data Valuation Based on the Shapley Value [link] [technical note]
- Ruoxi Jia, David Dao, Boxin Wang, Frances Ann Hubis, Nick Hynes, Nezihe Merve Gurel, Bo Li, Ce Zhang, Dawn Song, and Costas J. Spanos. AISTATS, 2019.

2016

"Influence Sketching": Finding Influential Samples in Large-Scale Regressions [link]
- Mike Wojnowicz, Ben Cruz, Xuan Zhao, Brian Wallace, Matt Wolff, Jay Luan, Caleb Crable. BigData, 2016.

Older

The Shapley Value: Essays in Honor of Lloyd S. Shapley [link]
- Lloyd S. Shapley and Alvin E. Roth. ISBN 052136177X. 1988.
Residuals and Influence in Regression [link]
- R. Dennis Cook and Sanford Weisberg. ISBN 041224280X. 1982.
Detection of Influential Observations in Linear Regression [link]
- R. Dennis Cook. Technometrics, 1977.
On the Uniqueness of the Shapley Value [link]
- Pradeep Dubey. International Journal of Game Theory, 1975.
A Value for n-Person Games [link]
- Lloyd S. Shapley. Contributions to the Theory of Games II, 1953.

Gradient-Based Estimators

Click on the arrow next to the year show/hide the corresponding papers.

2024

Data Attribution for Diffusion Models: Timestep-induced Bias in Influence Estimation [link]
- Tong Xie, Haoyu Li, Andrew Bai, and Cho-Jui Hsieh. 2024.
Intriguing Properties of Data Attribution on Diffusion Models [link] [code]
- Xiaosen Zheng, Tianyu Pang, Chao Du, Jing Jiang, and Min Lin. ICLR, 2024.

2023

Studying Large Language Model Generalization with Influence Functions [link]
- Roger Grosse, Juhan Bae, Cem Anil, Nelson Elhage, Alex Tamkin, Amirhossein Tajdini, Benoit Steiner, Dustin Li, Esin Durmus, Ethan Perez, Evan Hubinger, Kamilė Lukošiūtė, Karina Nguyen, Nicholas Joseph, Sam McCandlish, Jared Kaplan, and Samuel R. Bowman. 2023.
Representer Point Selection for Explaining Regularized High-dimensional Models [link]
- Che-Ping Tsai, Jiong Zhang, Eli Chien, Hsiang-Fu Yu, Cho-Jui Hsieh, and Pradeep Ravikumar. ICML, 2023.
Theoretical and Practical Perspectives on what Influence Functions Do [link]
- Andrea Schioppa, Katja Filippova, Ivan Titov, and Polina Zablotskaia. 2023.
TRAK: Attributing Model Behavior at Scale [link] [code]
- Sung Min Park, Kristian Georgiev, Andrew Ilyas, Guillaume Leclerc, and Aleksander Madry. ICML, 2023.

2022

Identifying a Training-Set Attack’s Target Using Renormalized Influence Estimation [link] [code]
- Zayd Hammoudeh and Daniel Lowd. CCS, 2022.
Cross-Loss Influence Functions to Explain Deep Network Representations [link] [code]
- Andrew Silva, Rohit Chopra, and Matthew Gombolay. AISTATS, 2022.
If Influence Functions are the Answer, Then What is the Question? [link]
- Juhan Bae, Nathan Ng, Alston Lo, Marzyeh Ghassemi, and Roger Grosse. 2022.
First is Better Than Last for Training Data Influence [link]
- Chih-Kuan Yeh, Ankur Taly, Mukund Sundararajan, Frederick Liu, and Pradeep Ravikumar. NeurIPS, 2022.
Influence Functions for Sequence Tagging Models [link] [code]
- Sarthak Jain, Varun Manjunatha, Byron C. Wallace, Ani Nenkova. Findings of EMNLP, 2022.
Scaling Up Influence Functions [link] [code] [video]
- Andrea Schioppa, Polina Zablotskaia, David Vilar Torres, and Artem Sokolov. AAAI, 2022.
Rethinking Influence Functions of Neural Networks in the Over-Parameterized Regime [link]
- Rui Zhang and Shihua Zhang. AAAI, 2022.
TracInAD: Measuring Influence for Anomaly Detection [link] [code]
- Hugo Thimonier, Fabrice Popineau, Arpad Rimmel, Bich-Liˆen Doan, and Fabrice Daniel. IJCNN, 2021.

2021

Influence Functions in Deep Learning Are Fragile [link] [video]
- Samyadeep Basu, Phil Pope, and Soheil Feizi. ICLR, 2021.
HyDRA: Hypergradient Data Relevance Analysis for Interpreting Deep Neural Networks [link] [code] [video]
- Yuanyuan Chen, Boyang Li, Han Yu, Pengcheng Wu, and Chunyan Miao. AAAI, 2021.
On Memorization in Probabilistic Deep Generative Models [link] [code] [video]
- Gerrit J. J. van den Burg and Christopher K. I. Williams. NeurIPS, 2021.
Understanding Instance-based Interpretability of Variational Auto-Encoders [link] [code] [video]
- Zhifeng Kong and Kamalika Chaudhuri. NeurIPS, 2021.
FastIF: Scalable Influence Functions for Efficient Model Interpretation and Debugging [link] [code] [video]
- Han Guo, Nazneen Fatema Rajani, Peter Hase, Mohit Bansal, and Caiming Xiong. EMNLP, 2021.
Influence Estimation for Generative Adversarial Networks [link] [code] [video]
- Naoyuki Terashita, Hiroki Ohashi, Yuichi Nonaka, and Takashi Kanemaru. ICLR, 2021.
Revisiting Methods for Finding Influential Examples [link]
- Karthikeyan K and Anders Søgaard. 2021.
Finding High-Value Training Data Subset through Differentiable Convex Programming [link] [code]
- Soumi Das, Arshdeep Singh, Saptarshi Chatterjee, Suparna Bhattacharya, and Sourangshu Bhattacharya. ECML-PKDD, 2021.
Representer Point Selection via Local Jacobian Expansion for Post-hoc Classifier Explanation of Deep Neural Networks and Ensemble Models [link] [code] [video]
- Yi Sui, Ga Wu, and Scott Sanner. NeurIPS, 2021.
Simple, Attack-Agnostic Defense Against Targeted Training Set Attacks Using Cosine Similarity [link] [code]
- Zayd Hammoudeh and Daniel Lowd. UDL, 2021.

2020

Estimating Training Data Influence by Tracing Gradient Descent [link] [code] [video]
- Garima Pruthi, Frederick Liu, Satyen Kale, and Mukund Sundararajan. NeurIPS, 2020.
RelatIF: Identifying Explanatory Training Samples via Relative Influence [link] [video]
- Elnaz Barshan, Marc-Etienne Brunet, and Gintare Karolina Dziugaite. AISTATS, 2020.
On Second-Order Group Influence Functions for Black-Box Predictions [link] [video]
- Samyadeep Basu, Xuchen You, and Soheil Feizi. ICML, 2020.
Efficient Estimation of Influence of a Training Instance [link] [slides]
- Sosuke Kobayashi, Sho Yokoi, Jun Suzuki, and Kentaro Inui. SustaiNLP, 2020.

2019

On the Accuracy of Influence Functions for Measuring Group Effects [link] [code]
- Pang Wei Koh, Kai-Siang Ang, Hubert H. K. Teo, and Percy Liang. NeurIPS, 2019.
Data Cleansing for Models Trained with SGD [link] [code]
- Satoshi Hara, Atsushi Nitanda, and Takanori Maehara. NeurIPS, 2019.

2018

Representer Point Selection for Explaining Deep Neural Networks [link] [code]
- Chih-Kuan Yeh, Joon Sik Kim, Ian E.H. Yen, and Pradeep Ravikumar. NeurIPS, 2018.

2017

Understanding Black-box Predictions via Influence Functions [link] [code] [video]
- Pang Wei Koh and Percy Liang. ICML, 2017. (Best paper award winner)
A Closer Look at Memorization in Deep Networks [link]
- Devansh Arpit, Stanislaw Jastrzebski, Nicolas Ballas, David Krueger, Emmanuel Bengio, Maxinder S. Kanwal, Tegan Maharaj, Asja Fischer, Aaron Courville, Yoshua Bengio, and Simon Lacoste-Julien. ICML, 2017.

Older

The Influence Curve and its Role in Robust Estimation [link]
- Frank R. Hampel. Journal of the American Statistical Association, 1974.
The Infinitesimal Jackknife [link]
- Louis A. Jaeckel. 1972.

Non-Parametric Methods

Click on the arrow next to the year show/hide the corresponding papers.

2023

Adapting and Evaluating Influence-Estimation Methods for Gradient-Boosted Decision Trees [link] [code]
- Jonathan Brophy, Zayd Hammoudeh, and Daniel Lowd. JMLR, 2023.

2019

Efficient Task-Specific Data Valuation for Nearest Neighbor Algorithms [link] [code] [technical note]
- Ruoxi Jia, David Dao, Boxin Wang, Frances Ann Hubis, Nezihe Merve Gurel, Bo Li, Ce Zhang, Costas J. Spanos, and Dawn Song. PVLDB, 2019.

2018

Finding Influential Training Samples for Gradient Boosted Decision Trees [link] [code]
- Boris Sharchilev, Yury Ustinovskiy, Pavel Serdyukov, and Maarten de Rijke. ICML, 2018.

Applications of Influence Analysis

Click on the arrow next to the year show/hide the corresponding papers.

Active Learning

Influence Selection for Active Learning [link] [code]
- Zhuoming Liu, Hao Ding, Huaping Zhong, Weijia Li, Jifeng Dai, and Conghui He. ICCV, 2021.
RIM: Reliable Influence-based Active Learning on Graphs [link] [code]
- Wentao Zhang, Yexin Wang, Zhenbang You, Meng Cao, Ping Huang, Jiulong Shan, Zhi Yang, and Bin Cui. NeurIPS, 2021.

Adversarial Attacks

Membership Inference Attack Using Self Influence Functions [link] [code]
- Gilad Cohen and Raja Giryes. 2022.
Influence Function based Data Poisoning Attacks to Top-N Recommender Systems [link]
- Minghong Fang, Neil Zhenqiang Gong, and Jia Liu. WWW, 2020.

Adversarial Defenses

Identifying a Training-Set Attack’s Target Using Renormalized Influence Estimation [link] [code] [video]
- Zayd Hammoudeh and Daniel Lowd. CCS, 2022.
Detecting Adversarial Samples Using Influence Functions and Nearest Neighbors [link] [code]
- Gilad Cohen, Guillermo Sapiro, and Raja Giryes. CVPR, 2020.

Data Augmentation

Influence-guided Data Augmentation for Neural Tensor Completion [link] [code]
- Sejoon Oh, Sungchul Kim, Ryan A. Rossi, and Srijan Kumar. CIKM, 2021.
Learning Augmentation Network via Influence Functions [link] [video]
- Donghoon Lee, Hyunsin Park, Trung Pham, and Chang D. Yoo. CVPR, 2020.

Data Cleaning

Resolving Training Biases via Influence-based Data Relabeling [link] [code] [video]
- Shuming Kong, Yanyan Shen, and Linpeng Huang. ICLR, 2022.
Influence Based Re-Weighing for Labeling Noise in Medical Imaging [link]
- Joschka Braun, Micha Kornreich, JinHyeong Park, Jayashri Pawar, James Browning, Richard Herzog, Benjamin Odry, and Li Zhang. ISBI, 2022.
Fortifying Toxic Speech Detectors Against Veiled Toxicity [link] [code] [video]
- Xiaochuang Han and Yulia Tsvetkov. EMNLP, 2020.

Fairness and Explainability

Fair Classifiers Without Fair Training: An Influence-Guided Data Sampling Approach [link]
- Jinlong Pang, Jialu Wang, Zhaowei Zhu, Yuanshun Yao, Chen Qian, and Yang Liu. 2024.
Exploring Practitioner Perspectives On Training Data Attribution Explanations [link]
- Elisa Nguyen, Evgenii Kortukov, Jean Song, and Seong Joon Oh. 2023.
Achieving Fairness at No Utility Cost via Data Reweighing with Influence [link] [code]
- Peizhao Li and Hongfu Liu. ICML, 2022.
An Empirical Comparison of Instance Attribution Methods for NLP [link] [video] [code]
- Pouya Pezeshkpour, Sarthak Jain, Byron C. Wallace, and Sameer Singh. NAACL, 2021.
Leave-One-Out Unfairness [link] [video]
- Emily Black and Matt Fredrikson. FAccT, 2021.
The Many Shapley Values for Model Explanation [link]
- Mukund Sundararajan and Amir Najmi. ICML, 2020.
RelatIF: Identifying Explanatory Training Samples via Relative Influence [link] [video]
- Elnaz Barshan, Marc-Etienne Brunet, and Gintare Karolina Dziugaite. AISTATS, 2020.

Subsampling

Less Is Better: Unweighted Data Subsampling via Influence Function [link] [code]
- Zifeng Wang, Hong Zhu, Zhenhua Dong, Xiuqiang He, and Shao-Lun Huang. AAAI, 2020.
Optimal Subsampling with Influence Functions [link]
- Daniel Ting and Eric Brochu. NeurIPS, 2018.

Credits

The structure of this repository is inspired by and adapted from the backdoor-learning-resources repository.

ZaydH / influence_analysis_papers

Training Data Influence Analysis & Estimation Resources

What is Training Data Influence Analysis?

Citation

Survey

Retraining-Based Methods

Gradient-Based Estimators

Non-Parametric Methods

Applications of Influence Analysis

Credits

About