Who is Real Bob? Adversarial Attacks on Speaker Recognition Systems

Chen, Guangke; Chen, Sen; Fan, Lingling; Du, Xiaoning; Zhao, Zhe; Song, Fu; Liu, Yang

Full-text links:

Download:

Current browse context:

eess.AS

< prev | next >

new | recent | 1911

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Who is Real Bob? Adversarial Attacks on Speaker Recognition Systems

Authors: Guangke Chen, Sen Chen, Lingling Fan, Xiaoning Du, Zhe Zhao, Fu Song, Yang Liu

(Submitted on 3 Nov 2019 (v1), last revised 24 Apr 2020 (this version, v2))

Abstract: Speaker recognition (SR) is widely used in our daily life as a biometric authentication or identification mechanism. The popularity of SR brings in serious security concerns, as demonstrated by recent adversarial attacks. However, the impacts of such threats in the practical black-box setting are still open, since current attacks consider the white-box setting only. In this paper, we conduct the first comprehensive and systematic study of the adversarial attacks on SR systems (SRSs) to understand their security weakness in the practical blackbox setting. For this purpose, we propose an adversarial attack, named FAKEBOB, to craft adversarial samples. Specifically, we formulate the adversarial sample generation as an optimization problem, incorporated with the confidence of adversarial samples and maximal distortion to balance between the strength and imperceptibility of adversarial voices. One key contribution is to propose a novel algorithm to estimate the score threshold, a feature in SRSs, and use it in the optimization problem to solve the optimization problem. We demonstrate that FAKEBOB achieves 99% targeted attack success rate on both open-source and commercial systems. We further demonstrate that FAKEBOB is also effective on both open-source and commercial systems when playing over the air in the physical world. Moreover, we have conducted a human study which reveals that it is hard for human to differentiate the speakers of the original and adversarial voices. Last but not least, we show that four promising defense methods for adversarial attack from the speech recognition domain become ineffective on SRSs against FAKEBOB, which calls for more effective defense methods. We highlight that our study peeks into the security implications of adversarial attacks on SRSs, and realistically fosters to improve the security robustness of SRSs.

Comments:	IEEE Symposium on Security and Privacy 2021
Subjects:	Audio and Speech Processing (eess.AS); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Multimedia (cs.MM); Sound (cs.SD)
Cite as:	arXiv:1911.01840 [eess.AS]
	(or arXiv:1911.01840v2 [eess.AS] for this version)

Submission history

From: Fu Song [view email]
[v1] Sun, 3 Nov 2019 16:50:13 GMT (1289kb,D)
[v2] Fri, 24 Apr 2020 02:10:01 GMT (297kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:1911.01840

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Who is Real Bob? Adversarial Attacks on Speaker Recognition Systems

Submission history