We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.SD

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Remarks on Optimal Scores for Speaker Recognition

Authors: Dong Wang
Abstract: In this article, we first establish the theory of optimal scores for speaker recognition. Our analysis shows that the minimum Bayes risk (MBR) decisions for both the speaker identification and speaker verification tasks can be based on a normalized likelihood (NL). When the underlying generative model is a linear Gaussian, the NL score is mathematically equivalent to the PLDA likelihood ratio, and the empirical scores based on cosine distance and Euclidean distance can be seen as approximations of this linear Gaussian NL score under some conditions. We discuss a number of properties of the NL score and perform a simple simulation experiment to demonstrate the properties of the NL score.
Comments: 17 pages, 8 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multimedia (cs.MM); Sound (cs.SD); Machine Learning (stat.ML)
Cite as: arXiv:2010.04862 [cs.LG]
  (or arXiv:2010.04862v2 [cs.LG] for this version)

Submission history

From: Dong Wang [view email]
[v1] Sat, 10 Oct 2020 01:28:24 GMT (3832kb,D)
[v2] Fri, 30 Oct 2020 03:33:49 GMT (4006kb,D)

Link back to: arXiv, form interface, contact.