Not All Poisons are Created Equal: Robust Training against Data Poisoning (ICML 2022)

Overview

EPIC (Effective Poison Identification) is an efficient defense mechanism that significantly reduces the attack success rate of various data poisoning attacks by iteratively find and drop the isolated points in low-density gradient regions.

Not all poisons are created equal - only effective poisons are responsible for the success of an attack. They are closer to the target in the gradient space.
Effective poisons are isolated in the gradient space and thus can be found and dropped by EPIC iteratively during the training.

Updates

[Aug 6th, 2022] We released the code of EPIC.

[July 20th, 2022] We presented our paper at ICML 2022.

Install requirements

pip install -r requirements.txt

Prepare the Data

Precomputed poisoned datasets that are publicly available can be downloaded with the links below.

Gradient Matching (official repo): eps=8, eps=16
Bullseye Polytope (official repo): CIFAR-10 transfer, CIFAR-10 fine-tuning, TinyImageNet from scratch
Feature Collision (official repo): CIFAR-10 transfer
Sleeper Agent

Usage

See examples.sh for example usages.

Acknowledgements

Some code in this repo comes from the following repositories:

We thank these authors for making their code open-source.

Citation

Please cite our paper if you find the results or our code useful. 🍻

@inproceedings{yang2022not,
  title={Not All Poisons are Created Equal: Robust Training against Data Poisoning},
  author={Yang, Yu and Liu, Tian Yu and Mirzasoleiman, Baharan},
  booktitle={International Conference on Machine Learning},
  pages={25154--25165},
  year={2022},
  organization={PMLR}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
dataset		dataset
models		models
utils		utils
LICENSE		LICENSE
README.md		README.md
effective_poisons.png		effective_poisons.png
examples.sh		examples.sh
requirements.txt		requirements.txt
train_poison_epic.py		train_poison_epic.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Not All Poisons are Created Equal: Robust Training against Data Poisoning (ICML 2022)

Overview

Updates

Install requirements

Prepare the Data

Usage

Acknowledgements

Citation

About

Releases

Packages

Languages

License

BigML-CS-UCLA/EPIC

Folders and files

Latest commit

History

Repository files navigation

Not All Poisons are Created Equal: Robust Training against Data Poisoning (ICML 2022)

Overview

Updates

Install requirements

Prepare the Data

Usage

Acknowledgements

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages