We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.OC

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Optimization and Control

Title: Size Matters: Cardinality-Constrained Clustering and Outlier Detection via Conic Optimization

Abstract: Plain vanilla K-means clustering has proven to be successful in practice, yet it suffers from outlier sensitivity and may produce highly unbalanced clusters. To mitigate both shortcomings, we formulate a joint outlier detection and clustering problem, which assigns a prescribed number of datapoints to an auxiliary outlier cluster and performs cardinality-constrained K-means clustering on the residual dataset, treating the cluster cardinalities as a given input. We cast this problem as a mixed-integer linear program (MILP) that admits tractable semidefinite and linear programming relaxations. We propose deterministic rounding schemes that transform the relaxed solutions to feasible solutions for the MILP. We also prove that these solutions are optimal in the MILP if a cluster separation condition holds.
Subjects: Optimization and Control (math.OC); Machine Learning (stat.ML)
MSC classes: 90C22, 90C05, 62H30
Cite as: arXiv:1705.07837 [math.OC]
  (or arXiv:1705.07837v3 [math.OC] for this version)

Submission history

From: Napat Rujeerapaiboon [view email]
[v1] Mon, 22 May 2017 16:32:23 GMT (303kb,D)
[v2] Thu, 5 Oct 2017 11:08:49 GMT (325kb,D)
[v3] Thu, 10 Jan 2019 12:17:00 GMT (159kb,D)

Link back to: arXiv, form interface, contact.