Phonetic Feedback for Speech Enhancement With and Without Parallel Speech Data

Plantinga, Peter; Bagchi, Deblin; Fosler-Lussier, Eric

Full-text links:

Download:

Current browse context:

eess.AS

< prev | next >

new | recent | 2003

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Phonetic Feedback for Speech Enhancement With and Without Parallel Speech Data

Authors: Peter Plantinga, Deblin Bagchi, Eric Fosler-Lussier

(Submitted on 3 Mar 2020)

Abstract: While deep learning systems have gained significant ground in speech enhancement research, these systems have yet to make use of the full potential of deep learning systems to provide high-level feedback. In particular, phonetic feedback is rare in speech enhancement research even though it includes valuable top-down information. We use the technique of mimic loss to provide phonetic feedback to an off-the-shelf enhancement system, and find gains in objective intelligibility scores on CHiME-4 data. This technique takes a frozen acoustic model trained on clean speech to provide valuable feedback to the enhancement model, even in the case where no parallel speech data is available. Our work is one of the first to show intelligibility improvement for neural enhancement systems without parallel speech data, and we show phonetic feedback can improve a state-of-the-art neural enhancement system trained with parallel speech data.

Comments:	4 pages + 1 page for references, accepted to ICASSP 2020
Subjects:	Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
Cite as:	arXiv:2003.01769 [eess.AS]
	(or arXiv:2003.01769v1 [eess.AS] for this version)

Submission history

From: Peter Plantinga [view email]
[v1] Tue, 3 Mar 2020 20:06:24 GMT (55kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2003.01769

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: Phonetic Feedback for Speech Enhancement With and Without Parallel Speech Data

Submission history