Autoregressive Perturbations for Data Poisoning

Sandoval-Segura, Pedro; Singla, Vasu; Geiping, Jonas; Goldblum, Micah; Goldstein, Tom; Jacobs, David W.

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2206

Computer Science > Machine Learning

Title: Autoregressive Perturbations for Data Poisoning

Authors: Pedro Sandoval-Segura, Vasu Singla, Jonas Geiping, Micah Goldblum, Tom Goldstein, David W. Jacobs

(Submitted on 8 Jun 2022 (v1), last revised 13 Oct 2022 (this version, v3))

Abstract: The prevalence of data scraping from social media as a means to obtain datasets has led to growing concerns regarding unauthorized use of data. Data poisoning attacks have been proposed as a bulwark against scraping, as they make data "unlearnable" by adding small, imperceptible perturbations. Unfortunately, existing methods require knowledge of both the target architecture and the complete dataset so that a surrogate network can be trained, the parameters of which are used to generate the attack. In this work, we introduce autoregressive (AR) poisoning, a method that can generate poisoned data without access to the broader dataset. The proposed AR perturbations are generic, can be applied across different datasets, and can poison different architectures. Compared to existing unlearnable methods, our AR poisons are more resistant against common defenses such as adversarial training and strong data augmentations. Our analysis further provides insight into what makes an effective data poison.

Comments:	Accepted to NeurIPS 2022. Code available at this https URL
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR)
Cite as:	arXiv:2206.03693 [cs.LG]
	(or arXiv:2206.03693v3 [cs.LG] for this version)

Submission history

From: Pedro Sandoval-Segura [view email]
[v1] Wed, 8 Jun 2022 06:24:51 GMT (8768kb,D)
[v2] Wed, 15 Jun 2022 06:44:17 GMT (8769kb,D)
[v3] Thu, 13 Oct 2022 20:20:22 GMT (8771kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2206.03693

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Autoregressive Perturbations for Data Poisoning

Submission history