Real-time, Universal, and Robust Adversarial Attacks Against Speaker Recognition Systems

Xie, Yi; Shi, Cong; Li, Zhuohang; Liu, Jian; Chen, Yingying; Yuan, Bo

Full-text links:

Download:

Current browse context:

eess.AS

< prev | next >

new | recent | 2003

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Real-time, Universal, and Robust Adversarial Attacks Against Speaker Recognition Systems

Authors: Yi Xie, Cong Shi, Zhuohang Li, Jian Liu, Yingying Chen, Bo Yuan

(Submitted on 4 Mar 2020 (v1), last revised 1 May 2020 (this version, v2))

Abstract: As the popularity of voice user interface (VUI) exploded in recent years, speaker recognition system has emerged as an important medium of identifying a speaker in many security-required applications and services. In this paper, we propose the first real-time, universal, and robust adversarial attack against the state-of-the-art deep neural network (DNN) based speaker recognition system. Through adding an audio-agnostic universal perturbation on arbitrary enrolled speaker's voice input, the DNN-based speaker recognition system would identify the speaker as any target (i.e., adversary-desired) speaker label. In addition, we improve the robustness of our attack by modeling the sound distortions caused by the physical over-the-air propagation through estimating room impulse response (RIR). Experiment using a public dataset of 109 English speakers demonstrates the effectiveness and robustness of our proposed attack with a high attack success rate of over 90%. The attack launching time also achieves a 100X speedup over contemporary non-universal attacks.

Comments:	Published as a conference paper at ICASSP 2020
Subjects:	Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
Cite as:	arXiv:2003.02301 [eess.AS]
	(or arXiv:2003.02301v2 [eess.AS] for this version)

Submission history

From: Yi Xie [view email]
[v1] Wed, 4 Mar 2020 19:30:15 GMT (370kb,D)
[v2] Fri, 1 May 2020 02:33:22 GMT (370kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2003.02301

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Real-time, Universal, and Robust Adversarial Attacks Against Speaker Recognition Systems

Submission history