Uncovering Adversarial Risks of Test-Time Adaptation

Wu, Tong; Jia, Feiran; Qi, Xiangyu; Wang, Jiachen T.; Sehwag, Vikash; Mahloujifar, Saeed; Mittal, Prateek

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2301

Computer Science > Machine Learning

Title: Uncovering Adversarial Risks of Test-Time Adaptation

Authors: Tong Wu, Feiran Jia, Xiangyu Qi, Jiachen T. Wang, Vikash Sehwag, Saeed Mahloujifar, Prateek Mittal

(Submitted on 29 Jan 2023 (v1), last revised 4 Feb 2023 (this version, v2))

Abstract: Recently, test-time adaptation (TTA) has been proposed as a promising solution for addressing distribution shifts. It allows a base model to adapt to an unforeseen distribution during inference by leveraging the information from the batch of (unlabeled) test data. However, we uncover a novel security vulnerability of TTA based on the insight that predictions on benign samples can be impacted by malicious samples in the same batch. To exploit this vulnerability, we propose Distribution Invading Attack (DIA), which injects a small fraction of malicious data into the test batch. DIA causes models using TTA to misclassify benign and unperturbed test data, providing an entirely new capability for adversaries that is infeasible in canonical machine learning pipelines. Through comprehensive evaluations, we demonstrate the high effectiveness of our attack on multiple benchmarks across six TTA methods. In response, we investigate two countermeasures to robustify the existing insecure TTA implementations, following the principle of "security by design". Together, we hope our findings can make the community aware of the utility-security tradeoffs in deploying TTA and provide valuable insights for developing robust TTA approaches.

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR)
Cite as:	arXiv:2301.12576 [cs.LG]
	(or arXiv:2301.12576v2 [cs.LG] for this version)

Submission history

From: Tong Wu [view email]
[v1] Sun, 29 Jan 2023 22:58:05 GMT (6525kb,D)
[v2] Sat, 4 Feb 2023 16:44:31 GMT (6537kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2301.12576

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Uncovering Adversarial Risks of Test-Time Adaptation

Submission history