Targeted universal adversarial perturbation

This repository contains Keras implementation of our simple iterative method for generating a targeted universal adversarial perturbation (UAP), which causes deep natural networks to classify most input images into a specific class, as described in the following paper:

Hirano H and Takemoto K (2020) Simple iterative method for generating targeted universal adversarial perturbations. Algorithms 13, 268 (2020). arXiv:1911.06502

Our method is also available in Adversarial Robustness Toolbox, a Python library for machine learning security.

In this repository, we used the VGG-20 model for the CIFAR-10 dataset obtained from a GitHub repository GuanqiaoDing/CNN-CIFAR10

Usage

Install the targeted UAP method.

pip install git+https://github.com/hkthirano/adversarial-robustness-toolbox

Generate a targeted UAP.

python generate_noise.py

# === Targeted UAP ===
# norm2: 4.8 %
# targeted_success_rate_train: 79.4 %
# targeted_success_rate_test: 79.0 %
# === Random Noise ===
# norm2_rand: 4.8 %
# targeted_success_rate_train_rand: 9.7 %
# targeted_success_rate_test_rand: 9.7 %

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
model		model
LICENSE		LICENSE
README.md		README.md
cifar10_example.jpg		cifar10_example.jpg
generate_noise.py		generate_noise.py
noise.npy		noise.npy
vgg_model.py		vgg_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Targeted universal adversarial perturbation

Usage

About

Releases

Packages

Contributors 2

Languages

License

hkthirano/targeted_UAP_CIFAR10

Folders and files

Latest commit

History

Repository files navigation

Targeted universal adversarial perturbation

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages