A neural network-supported two-stage algorithm for lightweight dereverberation on hearing devices

Lemercier, Jean-Marie; Thiemann, Joachim; Koning, Raphael; Gerkmann, Timo

doi:10.1186/s13636-023-00285-8

Full-text links:

Download:

PDF only

Current browse context:

eess.AS

< prev | next >

new | recent | 2204

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: A neural network-supported two-stage algorithm for lightweight dereverberation on hearing devices

Authors: Jean-Marie Lemercier, Joachim Thiemann, Raphael Koning, Timo Gerkmann

(Submitted on 6 Apr 2022 (v1), last revised 31 May 2023 (this version, v2))

Abstract: A two-stage lightweight online dereverberation algorithm for hearing devices is presented in this paper. The approach combines a multi-channel multi-frame linear filter with a single-channel single-frame post-filter. Both components rely on power spectral density (PSD) estimates provided by deep neural networks (DNNs). By deriving new metrics analyzing the dereverberation performance in various time ranges, we confirm that directly optimizing for a criterion at the output of the multi-channel linear filtering stage results in a more efficient dereverberation as compared to placing the criterion at the output of the DNN to optimize the PSD estimation. More concretely, we show that training this stage end-to-end helps further remove the reverberation in the range accessible to the filter, thus increasing the \textit{early-to-moderate} reverberation ratio. We argue and demonstrate that it can then be well combined with a post-filtering stage to efficiently suppress the residual late reverberation, thereby increasing the \textit{early-to-final} reverberation ratio. This proposed two stage procedure is shown to be both very effective in terms of dereverberation performance and computational demands, as compared to e.g. recent state-of-the-art DNN approaches. Furthermore, the proposed two-stage system can be adapted to the needs of different types of hearing-device users by controlling the amount of reduction of early reflections.

Comments:	Accepted for publication in EURASIP Journal on Audio, Speech and Music Processing
Subjects:	Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
DOI:	10.1186/s13636-023-00285-8
Cite as:	arXiv:2204.02978 [eess.AS]
	(or arXiv:2204.02978v2 [eess.AS] for this version)

Submission history

From: Jean-Marie Lemercier [view email]
[v1] Wed, 6 Apr 2022 11:08:28 GMT (583kb,D)
[v2] Wed, 31 May 2023 15:34:46 GMT (2727kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2204.02978

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Audio and Speech Processing

Title: A neural network-supported two-stage algorithm for lightweight dereverberation on hearing devices

Submission history