Current browse context:
stat.ML
Change to browse by:
References & Citations
Statistics > Machine Learning
Title: MAP Clustering under the Gaussian Mixture Model via Mixed Integer Nonlinear Optimization
(Submitted on 8 Nov 2019 (v1), last revised 17 Mar 2020 (this version, v2))
Abstract: We present a global optimization approach for solving the maximum a-posteriori (MAP) clustering problem under the Gaussian mixture model.Our approach can accommodate side constraints and it preserves the combinatorial structure of the MAP clustering problem by formulating it asa mixed-integer nonlinear optimization problem (MINLP). We approximate the MINLP through a mixed-integer quadratic program (MIQP) transformation that improves computational aspects while guaranteeing $\epsilon$-global optimality. An important benefit of our approach is the explicit quantification of the degree of suboptimality, via the optimality gap, en route to finding the globally optimal MAP clustering. Numerical experiments comparing our method to other approaches show that our method finds a better solution than standard clustering methods. Finally, we cluster a real breast cancer gene expression data set incorporating intrinsic subtype information; the induced constraints substantially improve the computational performance and produce more coherent and bio-logically meaningful clusters.
Submission history
From: Patrick Flaherty [view email][v1] Fri, 8 Nov 2019 15:53:26 GMT (124kb,D)
[v2] Tue, 17 Mar 2020 02:51:11 GMT (969kb,D)
Link back to: arXiv, form interface, contact.