We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.CO

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Computation

Title: Optimal Transport: Fast Probabilistic Approximation with Exact Solvers

Abstract: We propose a simple subsampling scheme for fast randomized approximate computation of optimal transport distances. This scheme operates on a random subset of the full data and can use any exact algorithm as a black-box back-end, including state-of-the-art solvers and entropically penalized versions. It is based on averaging the exact distances between empirical measures generated from independent samples from the original measures and can easily be tuned towards higher accuracy or shorter computation times. To this end, we give non-asymptotic deviation bounds for its accuracy in the case of discrete optimal transport problems. In particular, we show that in many important instances, including images (2D-histograms), the approximation error is independent of the size of the full problem. We present numerical experiments that demonstrate that a very good approximation in typical applications can be obtained in a computation time that is several orders of magnitude smaller than what is required for exact computation of the full problem.
Comments: to appear in Journal of Machine Learning Research
Subjects: Computation (stat.CO); Methodology (stat.ME)
Journal reference: Journal of Machine Learning Research 20(105):1-23, 2019
Cite as: arXiv:1802.05570 [stat.CO]
  (or arXiv:1802.05570v4 [stat.CO] for this version)

Submission history

From: Yoav Zemel [view email]
[v1] Wed, 14 Feb 2018 14:59:32 GMT (225kb,D)
[v2] Tue, 4 Sep 2018 10:11:37 GMT (229kb,D)
[v3] Thu, 14 Mar 2019 23:45:54 GMT (694kb,D)
[v4] Fri, 5 Jul 2019 13:11:07 GMT (694kb,D)

Link back to: arXiv, form interface, contact.