Current browse context:
cs.SD
Change to browse by:
References & Citations
Computer Science > Sound
Title: Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection
(Submitted on 20 Nov 2021 (v1), last revised 4 Oct 2022 (this version, v4))
Abstract: In many speech-enabled human-machine interaction scenarios, user speech can overlap with the device playback audio. In these instances, the performance of tasks such as keyword-spotting (KWS) and device-directed speech detection (DDD) can degrade significantly. To address this problem, we propose an implicit acoustic echo cancellation (iAEC) framework where a neural network is trained to exploit the additional information from a reference microphone channel to learn to ignore the interfering signal and improve detection performance. We study this framework for the tasks of KWS and DDD on, respectively, an augmented version of Google Speech Commands v2 and a real-world Alexa device dataset. Notably, we show a 56% reduction in false-reject rate for the DDD task during device playback conditions. We also show comparable or superior performance over a strong end-to-end neural echo cancellation + KWS baseline for the KWS task with an order of magnitude less computational requirements.
Submission history
From: Thomas Balestri [view email][v1] Sat, 20 Nov 2021 17:21:16 GMT (447kb,D)
[v2] Wed, 9 Feb 2022 11:13:33 GMT (521kb,D)
[v3] Mon, 21 Mar 2022 14:01:09 GMT (120kb,D)
[v4] Tue, 4 Oct 2022 15:34:10 GMT (111kb,D)
Link back to: arXiv, form interface, contact.