We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.OC

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Optimization and Control

Title: Faster Wasserstein Distance Estimation with the Sinkhorn Divergence

Authors: Lenaic Chizat (LMO), Pierre Roussillon (DMA), Flavien Léger (DMA), François-Xavier Vialard (Univ Gustave Eiffel), Gabriel Peyré (DMA)
Abstract: The squared Wasserstein distance is a natural quantity to compare probability distributions in a non-parametric setting. This quantity is usually estimated with the plug-in estimator, defined via a discrete optimal transport problem which can be solved to $\epsilon$-accuracy by adding an entropic regularization of order $\epsilon$ and using for instance Sinkhorn's algorithm. In this work, we propose instead to estimate it with the Sinkhorn divergence, which is also built on entropic regularization but includes debiasing terms. We show that, for smooth densities, this estimator has a comparable sample complexity but allows higher regularization levels, of order $\epsilon^{1/2}$, which leads to improved computational complexity bounds and a strong speedup in practice. Our theoretical analysis covers the case of both randomly sampled densities and deterministic discretizations on uniform grids. We also propose and analyze an estimator based on Richardson extrapolation of the Sinkhorn divergence which enjoys improved statistical and computational efficiency guarantees, under a condition on the regularity of the approximation error, which is in particular satisfied for Gaussian densities. We finally demonstrate the efficiency of the proposed estimators with numerical experiments.
Subjects: Optimization and Control (math.OC); Statistics Theory (math.ST); Machine Learning (stat.ML)
Journal reference: Neural Information Processing Systems, Dec 2020, Vancouver, Canada
Cite as: arXiv:2006.08172 [math.OC]
  (or arXiv:2006.08172v2 [math.OC] for this version)

Submission history

From: Lenaic Chizat [view email]
[v1] Mon, 15 Jun 2020 06:58:16 GMT (978kb,D)
[v2] Thu, 29 Oct 2020 15:15:37 GMT (985kb,D)

Link back to: arXiv, form interface, contact.