Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: Remarks on Optimal Scores for Speaker Recognition
(Submitted on 10 Oct 2020 (v1), last revised 30 Oct 2020 (this version, v2))
Abstract: In this article, we first establish the theory of optimal scores for speaker recognition. Our analysis shows that the minimum Bayes risk (MBR) decisions for both the speaker identification and speaker verification tasks can be based on a normalized likelihood (NL). When the underlying generative model is a linear Gaussian, the NL score is mathematically equivalent to the PLDA likelihood ratio, and the empirical scores based on cosine distance and Euclidean distance can be seen as approximations of this linear Gaussian NL score under some conditions. We discuss a number of properties of the NL score and perform a simple simulation experiment to demonstrate the properties of the NL score.
Submission history
From: Dong Wang [view email][v1] Sat, 10 Oct 2020 01:28:24 GMT (3832kb,D)
[v2] Fri, 30 Oct 2020 03:33:49 GMT (4006kb,D)
Link back to: arXiv, form interface, contact.