Current browse context:
stat.ME
Change to browse by:
References & Citations
Statistics > Methodology
Title: Mixtures of Generalized Hyperbolic Distributions and Mixtures of Skew-t Distributions for Model-Based Clustering with Incomplete Data
(Submitted on 7 Mar 2017 (v1), last revised 19 Aug 2018 (this version, v5))
Abstract: Robust clustering from incomplete data is an important topic because, in many practical situations, real data sets are heavy-tailed, asymmetric, and/or have arbitrary patterns of missing observations. Flexible methods and algorithms for model-based clustering are presented via mixture of the generalized hyperbolic distributions and its limiting case, the mixture of multivariate skew-t distributions. An analytically feasible EM algorithm is formulated for parameter estimation and imputation of missing values for mixture models employing missing at random mechanisms. The proposed methodologies are investigated through a simulation study with varying proportions of synthetic missing values and illustrated using a real dataset. Comparisons are made with those obtained from the traditional mixture of generalized hyperbolic distribution counterparts by filling in the missing data using the mean imputation method.
Submission history
From: Paul McNicholas [view email][v1] Tue, 7 Mar 2017 02:14:38 GMT (104kb,D)
[v2] Sat, 25 Mar 2017 17:47:33 GMT (106kb,D)
[v3] Wed, 20 Dec 2017 18:37:31 GMT (165kb,D)
[v4] Fri, 27 Apr 2018 18:06:09 GMT (169kb,D)
[v5] Sun, 19 Aug 2018 21:50:15 GMT (193kb,D)
Link back to: arXiv, form interface, contact.