Current browse context:
stat.ML
Change to browse by:
References & Citations
Statistics > Machine Learning
Title: Black-box Certification and Learning under Adversarial Perturbations
(Submitted on 30 Jun 2020 (this version), latest version 22 Feb 2022 (v2))
Abstract: We formally study the problem of classification under adversarial perturbations, both from the learner's perspective, and from the viewpoint of a third-party who aims at certifying the robustness of a given black-box classifier. We analyze a PAC-type framework of semi-supervised learning and identify possibility and impossibility results for proper learning of VC-classes in this setting. We further introduce and study a new setting of black-box certification under limited query budget. We analyze this for various classes of predictors and types of perturbation. We also consider the viewpoint of a black-box adversary that aims at finding adversarial examples, showing that the existence of an adversary with polynomial query complexity implies the existence of a robust learner with small sample complexity.
Submission history
From: Vinayak Pathak [view email][v1] Tue, 30 Jun 2020 04:12:59 GMT (60kb)
[v2] Tue, 22 Feb 2022 15:38:06 GMT (57kb)
Link back to: arXiv, form interface, contact.