Current browse context:
math.OC
Change to browse by:
References & Citations
Mathematics > Optimization and Control
Title: Size Matters: Cardinality-Constrained Clustering and Outlier Detection via Conic Optimization
(Submitted on 22 May 2017 (v1), last revised 10 Jan 2019 (this version, v3))
Abstract: Plain vanilla K-means clustering has proven to be successful in practice, yet it suffers from outlier sensitivity and may produce highly unbalanced clusters. To mitigate both shortcomings, we formulate a joint outlier detection and clustering problem, which assigns a prescribed number of datapoints to an auxiliary outlier cluster and performs cardinality-constrained K-means clustering on the residual dataset, treating the cluster cardinalities as a given input. We cast this problem as a mixed-integer linear program (MILP) that admits tractable semidefinite and linear programming relaxations. We propose deterministic rounding schemes that transform the relaxed solutions to feasible solutions for the MILP. We also prove that these solutions are optimal in the MILP if a cluster separation condition holds.
Submission history
From: Napat Rujeerapaiboon [view email][v1] Mon, 22 May 2017 16:32:23 GMT (303kb,D)
[v2] Thu, 5 Oct 2017 11:08:49 GMT (325kb,D)
[v3] Thu, 10 Jan 2019 12:17:00 GMT (159kb,D)
Link back to: arXiv, form interface, contact.