Current browse context:
stat.ML
Change to browse by:
References & Citations
Statistics > Machine Learning
Title: Linear Optimal Transport Embedding: Provable Wasserstein classification for certain rigid transformations and perturbations
(Submitted on 20 Aug 2020 (v1), last revised 26 May 2021 (this version, v3))
Abstract: Discriminating between distributions is an important problem in a number of scientific fields. This motivated the introduction of Linear Optimal Transportation (LOT), which embeds the space of distributions into an $L^2$-space. The transform is defined by computing the optimal transport of each distribution to a fixed reference distribution, and has a number of benefits when it comes to speed of computation and to determining classification boundaries. In this paper, we characterize a number of settings in which LOT embeds families of distributions into a space in which they are linearly separable. This is true in arbitrary dimension, and for families of distributions generated through perturbations of shifts and scalings of a fixed distribution.We also prove conditions under which the $L^2$ distance of the LOT embedding between two distributions in arbitrary dimension is nearly isometric to Wasserstein-2 distance between those distributions. This is of significant computational benefit, as one must only compute $N$ optimal transport maps to define the $N^2$ pairwise distances between $N$ distributions. We demonstrate the benefits of LOT on a number of distribution classification problems.
Submission history
From: Caroline Moosmüller [view email][v1] Thu, 20 Aug 2020 19:09:33 GMT (101kb,D)
[v2] Wed, 7 Oct 2020 03:17:30 GMT (104kb,D)
[v3] Wed, 26 May 2021 03:48:35 GMT (41kb,D)
Link back to: arXiv, form interface, contact.