We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Spectral Clustering using PCKID - A Probabilistic Cluster Kernel for Incomplete Data

Abstract: In this paper, we propose PCKID, a novel, robust, kernel function for spectral clustering, specifically designed to handle incomplete data. By combining posterior distributions of Gaussian Mixture Models for incomplete data on different scales, we are able to learn a kernel for incomplete data that does not depend on any critical hyperparameters, unlike the commonly used RBF kernel. To evaluate our method, we perform experiments on two real datasets. PCKID outperforms the baseline methods for all fractions of missing values and in some cases outperforms the baseline methods with up to 25 percentage points.
Subjects: Machine Learning (stat.ML)
Cite as: arXiv:1702.07190 [stat.ML]
  (or arXiv:1702.07190v1 [stat.ML] for this version)

Submission history

From: Sigurd Løkse [view email]
[v1] Thu, 23 Feb 2017 12:19:31 GMT (265kb)

Link back to: arXiv, form interface, contact.