SlothSpeech: Denial-of-service Attack Against Speech Recognition Models

Haque, Mirazul; Shah, Rutvij; Chen, Simin; Şişman, Berrak; Liu, Cong; Yang, Wei

Full-text links:

Download:

Current browse context:

cs.SD

< prev | next >

new | recent | 2306

Computer Science > Sound

Title: SlothSpeech: Denial-of-service Attack Against Speech Recognition Models

Authors: Mirazul Haque, Rutvij Shah, Simin Chen, Berrak Şişman, Cong Liu, Wei Yang

(Submitted on 1 Jun 2023)

Abstract: Deep Learning (DL) models have been popular nowadays to execute different speech-related tasks, including automatic speech recognition (ASR). As ASR is being used in different real-time scenarios, it is important that the ASR model remains efficient against minor perturbations to the input. Hence, evaluating efficiency robustness of the ASR model is the need of the hour. We show that popular ASR models like Speech2Text model and Whisper model have dynamic computation based on different inputs, causing dynamic efficiency. In this work, we propose SlothSpeech, a denial-of-service attack against ASR models, which exploits the dynamic behaviour of the model. SlothSpeech uses the probability distribution of the output text tokens to generate perturbations to the audio such that efficiency of the ASR model is decreased. We find that SlothSpeech generated inputs can increase the latency up to 40X times the latency induced by benign input.

Subjects:	Sound (cs.SD); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2306.00794 [cs.SD]
	(or arXiv:2306.00794v1 [cs.SD] for this version)

Submission history

From: Mirazul Haque [view email]
[v1] Thu, 1 Jun 2023 15:25:14 GMT (408kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2306.00794

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Sound

Title: SlothSpeech: Denial-of-service Attack Against Speech Recognition Models

Submission history