Current browse context:
stat.CO
Change to browse by:
References & Citations
Statistics > Computation
Title: Investigation of Parameter Uncertainty in Clustering Using a Gaussian Mixture Model Via Jackknife, Bootstrap and Weighted Likelihood Bootstrap
(Submitted on 2 Oct 2015 (v1), last revised 22 Jul 2019 (this version, v5))
Abstract: Mixture models are a popular tool in model-based clustering. Such a model is often fitted by a procedure that maximizes the likelihood, such as the EM algorithm. At convergence, the maximum likelihood parameter estimates are typically reported, but in most cases little emphasis is placed on the variability associated with these estimates. In part this may be due to the fact that standard errors are not directly calculated in the model-fitting algorithm, either because they are not required to fit the model, or because they are difficult to compute. The examination of standard errors in model-based clustering is therefore typically neglected. The widely used R package mclust has recently introduced bootstrap and weighted likelihood bootstrap methods to facilitate standard error estimation. This paper provides an empirical comparison of these methods (along with the jackknife method) for producing standard errors and confidence intervals for mixture parameters. These methods are illustrated and contrasted in both a simulation study and in the traditional Old Faithful data set and Thyroid data set.
Submission history
From: Adrian O Hagan Dr [view email][v1] Fri, 2 Oct 2015 10:26:57 GMT (561kb)
[v2] Wed, 7 Feb 2018 11:18:50 GMT (1220kb)
[v3] Thu, 8 Feb 2018 16:21:19 GMT (1220kb)
[v4] Wed, 28 Feb 2018 15:53:16 GMT (1227kb)
[v5] Mon, 22 Jul 2019 13:23:43 GMT (1171kb)
Link back to: arXiv, form interface, contact.