MIAdefenseSELENA

By Xinyu Tang, Saeed Mahloujifar, Liwei Song, Virat Shejwalkar, Milad Nasr, Amir Houmansadr, Prateek Mittal

Code for "Mitigating Membership Inference Attacks by Self-Distillation Through a Novel Ensemble Architecture" in USENIX Security 2022.

Update 08/2022: We earned all badges (available, functional, reproduced) in USENIX artifact evaluation.

Files

├── MIAdefenseSELENA
|    ├── memguard      # pretrained NN MIA attack model for MemGuard
|    ├── env.yml      # specify root_dir and src_dir (ending with MIAdefenseSELENA)
|    ├── requirement.txt
|    ├── utils.py
|    ├── cifar_utils.py
|    ├── prepare_dataset.py     # prepare purhcase100 (X,npy, Y.npy) and texas100 (feats.npy, labels.npy)
|    ├── early_stopping.py      # load checkpoints saved in each epoch during undefended training, launch direct single-query attacks and plot Figure 4 in the paper.
|    ├── attack
|    |    ├── dsq_attack.py               # direct single-query attacks on purchase100/texas100/cifar100
|    |    ├── binary_flip_noise_attack.py # label-only attacks on purchase100/texas100
|    |	  ├── Aug_Attack.py               # augmenation attacks (data augmentation attacks, label-only) on cifar100
|    |    ├── CW_Attack.py                # cw attacks (boundary attacks, label-only) on cifar100
|    | 	  └── adaptive_attack.py          # adaptive attacks for SELENA on purchase100/texas100/cifar100
|    ├── models
|    |    ├── purchase.py # model for Purchase100
|    |	  ├── texas.py    # model for Texas100
|    |	  └── resnet.py   # model for CIFAR100   
|    ├── purchase
|    |    ├── data_partition.py # generate * npy files for member/nonmember sets to train/eval MIA attacks 
|    |    ├── Undefend
|    |    |    ├── train.py      # train the undefended model
|    |	  |    └── eval.py       # eval the undefended model via direct single-query attacks and label-only attacks
|    |	  ├── MemGuard
|    |	  |    ├── prepare_for_memguard.py  # save the predictions and logits as inputs for memguard
|    |	  |    ├── memguard_run.py          # memguard defense, output the perturbed predictions
|    |	  |    └── eval_memguard.py         # eval the memguard perturbed predictions via direct single-query attacks
|    |	  ├── AdvReg
|    |	  |    ├── train.py     # train the model via adversarial training
|    |	  |    └── eval.py      # eval the AdvReg model via direct single-query attacks and label-only attacks
|    |	  └── SELENA
|    |	       ├── generation10.py # generate non_model indices for defender's Split-AI
|    |	       ├── Split_AI
|    |	       |    ├── train.py   # train the Split-AI model
|    |	       |    └── eval.py    # eval the Split-AI model via direct single-query attacks (expect ~50\% accuracy) 
|    |	       ├── Distillation
|    |	       |    ├── train.py   # train a new model via distillation from Split-AI 
|    |	       |    └── eval.py    # eval the distillation model (i.e., the final output protected model) via direct single-query attacks and label-only attacks
|    |	       └── adaptive_attack
|    |	            ├── generation10.py # generate non_model indices for attacker's shadow Split-AI
|    |	            ├── train.py        # train the attacker's shadow Split-AI
|    |	            └── eval.py         # evaluate the distillation model (i.e., the final output protected model) via adaptive attacks
|    ├── texas    # The file structure in this folder is the same as in MIAdefenseSELENA/purchase
|    ├── cifar100
|    |    ├── data_partition.py  # generate * npy files for member/nonmember sets to train/eval MIA attacks 
|    |	  ├── Undefend
|    |	  |    ├── train.py      # train the undefended model
|    |	  |    └── eval.py       # eval the undefended model via direct single-query attacks
|    |	  |    ├── eval_aug.py   # eval the undefended model via augmenation attacks (data augmentation attacks, label-only)
|    |	  |    └── eval_cw.py    # eval the undefended model via cw attacks (boundary attacks, label-only)
|    |    ├── MemGuard
|    |	  |    ├── prepare_for_memguard.py  # save the predictions and logits as inputs for memguard
|    |	  |    ├── memguard_run.py          # memguard defense, output the perturbed predictions
|    |	  |    └── eval_memguard.py         # eval the memguard perturbed predictions via direct single-query attacks
|    |    ├── AdvReg
|    |	  |    ├── train.py      # train the model via adversarial training
|    |    |    ├── eval.py       # eval the AdvReg model via direct single-query attacks
|    |    |    ├── eval_aug.py   # eval the AdvReg model via augmenation attacks (data augmentation attacks, label-only)
|    |	  |    └── eval_cw.py    # eval the AdvReg model via cw attacks (boundary attacks, label-only)
|    |	  └── SELENA
|    |	       ├── generation10.py # generate non_model indices for defender's Split-AI
|    |	       ├── Split_AI
|    |	       |    ├── train.py   # train the Split-AI model
|    |	       |    └── eval.py    # eval the Split-AI model via direct single-query attacks (expect ~50\% accuracy) 
|    |	       ├── Distillation
|    |	       |    ├── train.py     # train a new model via distillation from Split-AI 
|    |	       |    ├── eval.py      # eval the distillation model (i.e., the final output protected model) via direct single-query attacks
|    |	       |    ├── eval_aug.py  # eval the distillation model (i.e., the final output protected model) via augmenation attacks (data augmentation attacks, label-only)
|    |	       |    └── eval_cw.py   # eval the distillation model (i.e., the final output protected model) via cw attacks (boundary attacks, label-only)
|    |	       └── adaptive_attack
|    |	            ├── generation10.py # generate non_model indices for attacker's shadow Split-AI
|    |	            ├── train.py        # train the attacker's shadow Split-AI
|    |	            └── eval.py         # evaluate the distillation model (i.e., the final output protected model) via adaptive attacks
└── MIA_root_dir
     ├── memguard    # attack models for the optimization process of memguard
     |    ├── purchase_MIA_model.h5
     |    ├── texas_MIA_model.h5
     |	  └── cifar100_MIA_model.h5
     ├── purchase
     |	  ├── data
     |	  |    ├── random_r_purchase100
     |    |    ├── X.npy
     |    |    ├── Y.npy
     |    |    ├── memguard
     |    |    |    ├── defense_results # the output of memguard
     |    |    |    └── prediction      # predictions and logits for the inputs of memguard
     |	  |    └── partition
     |	  |         ├──*.npy  # npy files for member/nonmember sets to train/eval MIA attacks 
     |	  |         └── K_L
     |    |             └── 25_10
     |	  |	            ├── defender  # non_model indices for defender's Split-AI
     |	  |                 └── attacker  # non_model indices for attacker's shadow Split-AI
     |	  └── checkpoints
     |	      ├── undefend   # model of undefended model
     |	      ├── AdvReg     # model of adversarial training
     |	      └── K_L
     |	          └── 25_10
     |                ├── split_ai # models of defender's Split-AI
     |                ├── selena   # model of defender's Distillation, i.e., the final output model
     |	      	      └── shadow   # models of attacker's shadow Split-AI
     ├── texas
     |    ├── data
     |    |    ├── random_r_texas100
     |    |    ├── feats.npy
     |    |    ├── labels.npy
     |    |    └── remaining files # remaining files have the same structure as MIA_root_dir/purchase/data (and will be created after running the corresponding codes)
     |	  └── checkpoints # files in this folder have the same structure as MIA_root_dir/purchase/checkpoints (and will be created after running the corresponding codes)
     └──  cifar100
      	  ├── data
      	  |    ├── random_r_cifar100
          |    ├── cifar-100-python
          |    └── remaining files # remaining files have the same structure as MIA_root_dir/purchase/data (and will be created after running the corresponding codes)
          └── checkpoints # files in this folder have the same structure as MIA_root_dir/purchase/checkpoints (and will be created after running the corresponding codes)

Getting Started

Before running code, you may need to follow these three steps to prepare:

Specify you root_dir and src_dir in env.yml. root_dir is the root directory to save the data and checkpoints (corresponds to MIA_root_dir in Files). src_dir is the root directory of the sourcecode (should endwith this repository name MIAdefenseSELENA).
Installing required packages. The code is tested with python 3.8.5, PyTorch 1.11.0 (for most of the experiments) and TensorFlow-2.9.1 (for MemGuard). The complete list of required packages are available in requirement.txt, and can be installed with pip install -r requirement.txt.
Preparing Datasets and pretrained NN MIA attack model for memguard. We use three datasets: Purchase100 [link], Texas100 [link], CIFAR100 [link]. You can prepare all three datasets by simply running the following command (this command will also move the pretrained NN MIA attack model for MemGuard to the assumed file path MIA_root_dir/memguard):

python prepare_datatset.py

Usage

You may refer file structures and comments in Files.
See misc/reproducibility.md for instructions to reproduce all results in the main body of paper.

Notes

The initial repo selects the model that performs best on the test set for final evaluation. We have tested that the bias issue is not significant on evaluated datasets(Purchase100/Texas100/CIFAR100). For further usage, it's advisable to use a validation set to select the model and final evaluate on the test set to avoid potential bias issue.
If some memory issues in Distillation/train.py occurs (examples below), you may consider decreasing test_batch_size (The initial intention to set a large test_batch_size is to utilize gpu computation at maximum and save time on Split-AI inference)).

ERROR: Unexpected bus error encountered in worker. This might be caused by insufficient shared memory (shm).

Some variable names may not be consistent (and kind of confusing). The following variable names are equivalent.
- train_member_label/know_train_label/train_label_tr: for member sets to train MIA model.
- test_member_label/unknow_train_label_train_label_te: for member sets to eval MIA model.
- train_nonmember_label/ref_label/attack_label: for nonmember sets to train MIA model.
- test_nonmember_label/test_label: for nonmember sets to eval MIA model.

Reference Repository

Adversarial Regularization: https://github.com/SPIN-UMass/ML-Privacy-Regulization
MemGuard: https://github.com/jinyuan-jia/MemGuard
Systematic direct single-query attacks: https://github.com/inspire-group/membership-inference-evaluation
Label-only attacks: https://github.com/cchoquette/membership-inference

Citations

If you find our work useful in your research, please consider citing:

@inproceedings{tang2022miadefenseselena,
  title={Mitigating Membership Inference Attacks by Self-Distillation Through a Novel Ensemble Architecture},
  author={Tang, Xinyu and Mahloujifar, Saeed and Song, Liwei and Shejwalkar, Virat and Nasr, Milad and Houmansadr, Amir and Mittal, Prateek},
  booktitle = {31st {USENIX} Security Symposium ({USENIX} Security)},
  year={2022}
}

inspire-group / MIAdefenseSELENA