We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: When OT meets MoM: Robust estimation of Wasserstein Distance

Abstract: Issued from Optimal Transport, the Wasserstein distance has gained importance in Machine Learning due to its appealing geometrical properties and the increasing availability of efficient approximations. In this work, we consider the problem of estimating the Wasserstein distance between two probability distributions when observations are polluted by outliers. To that end, we investigate how to leverage Medians of Means (MoM) estimators to robustify the estimation of Wasserstein distance. Exploiting the dual Kantorovitch formulation of Wasserstein distance, we introduce and discuss novel MoM-based robust estimators whose consistency is studied under a data contamination model and for which convergence rates are provided. These MoM estimators enable to make Wasserstein Generative Adversarial Network (WGAN) robust to outliers, as witnessed by an empirical study on two benchmarks CIFAR10 and Fashion MNIST. Eventually, we discuss how to combine MoM with the entropy-regularized approximation of the Wasserstein distance and propose a simple MoM-based re-weighting scheme that could be used in conjunction with the Sinkhorn algorithm.
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Journal reference: Proceedings of The 24th International Conference on Artificial Intelligence and Statistics, AISTATS 2021
Cite as: arXiv:2006.10325 [stat.ML]
  (or arXiv:2006.10325v3 [stat.ML] for this version)

Submission history

From: Guillaume Staerman [view email]
[v1] Thu, 18 Jun 2020 07:31:39 GMT (1108kb,D)
[v2] Thu, 22 Oct 2020 09:06:21 GMT (888kb,D)
[v3] Fri, 18 Feb 2022 17:46:46 GMT (889kb,D)

Link back to: arXiv, form interface, contact.