Alvov1 / Model-Inversion-Attack-with-GAN

Implementation of the model inversion attack on the Gated-Recurrent-Unit neural network

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Model inversion attack on neural network

Implementation of the model inversion attack on the Gated-Recurrent-Unit neural network (GRU) using Generative adversarial network (GAN) for samples generation. May be used to determine the membership of training images or to determine the complete set of training data.

GAN is used for creation proxy images for feeding the target system as input. Further, images classified with a high degree of confidence by the target system can be assigned to a member of the training sample used to train this system. A neural network of several GRU layers targetted to classification problem is used as a target system, for which, a mechanism for protecting random data noise is implemented in addition to the standard functionality.

The cifar10 dataset was used to train the target system from which only images of cats and dogs were pre-filtered. The same dataset with filtered dog images was used to train the generative gun network used to generate the images used to carry out the attack.

Results. Generated images confidently classified by the target system (left) and their possible prototypes from the training set of the target system (right):

generated_plot_319_26 dog2

generated_plot_319_41 dog7

About

Implementation of the model inversion attack on the Gated-Recurrent-Unit neural network


Languages

Language:Python 100.0%