We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.OC

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Optimization and Control

Title: Clustering, factor discovery and optimal transport

Abstract: The clustering problem, and more generally, latent factor discovery --or latent space inference-- is formulated in terms of the Wasserstein barycenter problem from optimal transport. The objective proposed is the maximization of the variability attributable to class, further characterized as the minimization of the variance of the Wasserstein barycenter. Existing theory, which constrains the transport maps to rigid translations, is extended to affine transformations. The resulting non-parametric clustering algorithms include k-means as a special case and exhibit more robust performance. A continuous version of these algorithms discovers continuous latent variables and generalizes principal curves. The strength of these algorithms is demonstrated by tests on both artificial and real-world data sets.
Comments: Improved clarity of presentation
Subjects: Optimization and Control (math.OC); Statistics Theory (math.ST)
MSC classes: 62H30, 62H25, 49K30
Cite as: arXiv:1902.10288 [math.OC]
  (or arXiv:1902.10288v3 [math.OC] for this version)

Submission history

From: Hongkang Yang [view email]
[v1] Wed, 27 Feb 2019 01:04:12 GMT (781kb,D)
[v2] Wed, 30 Oct 2019 20:14:42 GMT (1597kb,D)
[v3] Mon, 21 Sep 2020 13:13:40 GMT (1340kb,D)

Link back to: arXiv, form interface, contact.