An Initialization Scheme for Meeting Separation with Spatial Mixture Models

Boeddeker, Christoph; Cord-Landwehr, Tobias; von Neumann, Thilo; Haeb-Umbach, Reinhold

Full-text links:

Download:

Computer Science > Sound

Title: An Initialization Scheme for Meeting Separation with Spatial Mixture Models

Authors: Christoph Boeddeker, Tobias Cord-Landwehr, Thilo von Neumann, Reinhold Haeb-Umbach

(Submitted on 4 Apr 2022)

Abstract: Spatial mixture model (SMM) supported acoustic beamforming has been extensively used for the separation of simultaneously active speakers. However, it has hardly been considered for the separation of meeting data, that are characterized by long recordings and only partially overlapping speech. In this contribution, we show that the fact that often only a single speaker is active can be utilized for a clever initialization of an SMM that employs time-varying class priors. In experiments on LibriCSS we show that the proposed initialization scheme achieves a significantly lower Word Error Rate (WER) on a downstream speech recognition task than a random initialization of the class probabilities by drawing from a Dirichlet distribution. With the only requirement that the number of speakers has to be known, we obtain a WER of 5.9 %, which is comparable to the best reported WER on this data set. Furthermore, the estimated speaker activity from the mixture model serves as a diarization based on spatial information.

Comments:	Submitted to INTERSPEECH 2022
Subjects:	Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2204.01338 [cs.SD]
	(or arXiv:2204.01338v1 [cs.SD] for this version)

Submission history

From: Christoph Boeddeker [view email]
[v1] Mon, 4 Apr 2022 09:21:22 GMT (947kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2204.01338

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Sound

Title: An Initialization Scheme for Meeting Separation with Spatial Mixture Models

Submission history