We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.SD

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Sound

Title: Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection

Abstract: In many speech-enabled human-machine interaction scenarios, user speech can overlap with the device playback audio. In these instances, the performance of tasks such as keyword-spotting (KWS) and device-directed speech detection (DDD) can degrade significantly. To address this problem, we propose an implicit acoustic echo cancellation (iAEC) framework where a neural network is trained to exploit the additional information from a reference microphone channel to learn to ignore the interfering signal and improve detection performance. We study this framework for the tasks of KWS and DDD on, respectively, an augmented version of Google Speech Commands v2 and a real-world Alexa device dataset. Notably, we show a 56% reduction in false-reject rate for the DDD task during device playback conditions. We also show comparable or superior performance over a strong end-to-end neural echo cancellation + KWS baseline for the KWS task with an order of magnitude less computational requirements.
Comments: To be presented at SLT 2022
Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as: arXiv:2111.10639 [cs.SD]
  (or arXiv:2111.10639v4 [cs.SD] for this version)

Submission history

From: Thomas Balestri [view email]
[v1] Sat, 20 Nov 2021 17:21:16 GMT (447kb,D)
[v2] Wed, 9 Feb 2022 11:13:33 GMT (521kb,D)
[v3] Mon, 21 Mar 2022 14:01:09 GMT (120kb,D)
[v4] Tue, 4 Oct 2022 15:34:10 GMT (111kb,D)

Link back to: arXiv, form interface, contact.