adversarial-attacks regional-adversarial-attacks localized-adversarial-perturbation lp-norms

Regional Adversarial Perturbation

This repository contains the bare bone implementation of the regional adversarial attack used in the paper "Regional Image Perturbation Reduces Lp Norms of Adversarial Examples While Maintaining Model-to-model Transferability" published in the 2020 International Conference on Machine Learning, Workshop in Uncertainty & Robustness in Deep Learning (Link to the paper).

General Information

We experimented with a simple and general method of localizing adversarial perturbation and used Cross-Entropy Sign (used by FGS, IFGS, and PGD) to generate adversarial examples with regional perturbation. Experimented regions are indicated below.

Some of the produced adversarial examples are provided below. We observed that regional perturbation reducing Lp, with p in {0, 2, \infty}, norms of adversarial perturbation while maintaining model-to-model transferability, thus posing a serious threat to defenses that claim robustness under certain Lp norms.

Citation

If you find the code in this repository useful for your research, consider citing our paper.

@inproceedings{ozbulak2020regional,
    title={Regional Image Perturbation Reduces Lp Norms of Adversarial Examples While Maintaining Model-to-model Transferability},
    author={Ozbulak, Utku and Peck, Jonathan and De Neve, Wesley and Goossens, Bart and Saeys, Yvan and Van Messem, Arnout},
    booktitle={2020 International Conference on Machine Learning (ICML), Workshop in Uncertainty and Robustness in Deep Learning (UDL)},
    year={2020}
}

Requirements

python > 3.5
torch >= 0.4.0
torchvision >= 0.1.9
numpy >= 1.13.0
PIL >= 1.1.7

About

Code and materials for the paper "Regional Image Perturbation Reduces Lp Norms of Adversarial Examples While Maintaining Model-to-model Transferability" (ICML 2020, UDL).

adversarial-attacks regional-adversarial-attacks localized-adversarial-perturbation lp-norms

MIT License

Languages

Language:Python 100.0%