On Robustness of Kernel Clustering

Yan, Bowei; Sarkar, Purnamrita

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 1606

Statistics > Machine Learning

Title: On Robustness of Kernel Clustering

Authors: Bowei Yan, Purnamrita Sarkar

(Submitted on 6 Jun 2016 (v1), last revised 2 Dec 2016 (this version, v3))

Abstract: Clustering is one of the most important unsupervised problems in machine learning and statistics. Among many existing algorithms, kernel k-means has drawn much research attention due to its ability to find non-linear cluster boundaries and its inherent simplicity. There are two main approaches for kernel k-means: SVD of the kernel matrix and convex relaxations. Despite the attention kernel clustering has received both from theoretical and applied quarters, not much is known about robustness of the methods. In this paper we first introduce a semidefinite programming relaxation for the kernel clustering problem, then prove that under a suitable model specification, both the K-SVD and SDP approaches are consistent in the limit, albeit SDP is strongly consistent, i.e. achieves exact recovery, whereas K-SVD is weakly consistent, i.e. the fraction of misclassified nodes vanish.

Comments:	20 pages, 3 figures
Subjects:	Machine Learning (stat.ML)
Cite as:	arXiv:1606.01869 [stat.ML]
	(or arXiv:1606.01869v3 [stat.ML] for this version)

Submission history

From: Bowei Yan [view email]
[v1] Mon, 6 Jun 2016 19:26:23 GMT (117kb,D)
[v2] Tue, 18 Oct 2016 17:15:38 GMT (106kb,D)
[v3] Fri, 2 Dec 2016 01:12:08 GMT (105kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1606.01869

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: On Robustness of Kernel Clustering

Submission history