Wasserstein Distributional Robustness and Regularization in Statistical Learning

Gao, Rui; Chen, Xi; Kleywegt, Anton J.

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1712

Computer Science > Machine Learning

Title: Wasserstein Distributional Robustness and Regularization in Statistical Learning

Authors: Rui Gao, Xi Chen, Anton J. Kleywegt

(Submitted on 17 Dec 2017 (this version), latest version 30 Oct 2020 (v3))

Abstract: A central question in statistical learning is to design algorithms that not only perform well on training data, but also generalize to new and unseen data. In this paper, we tackle this question by formulating a distributionally robust stochastic optimization (DRSO) problem, which seeks a solution that minimizes the worst-case expected loss over a family of distributions that are close to the empirical distribution in Wasserstein distances. We establish a connection between such Wasserstein DRSO and regularization. More precisely, we identify a broad class of loss functions, for which the Wasserstein DRSO is asymptotically equivalent to a regularization problem with a gradient-norm penalty. Such relation provides new interpretations for problems involving regularization, including a great number of statistical learning problems and discrete choice models (e.g. multinomial logit). The connection suggests a principled way to regularize high-dimensional, non-convex problems. This is demonstrated through two applications: the training of Wasserstein generative adversarial networks (WGANs) in deep learning, and learning heterogeneous consumer preferences with mixed logit choice model.

Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:1712.06050 [cs.LG]
	(or arXiv:1712.06050v1 [cs.LG] for this version)

Submission history

From: Rui Gao [view email]
[v1] Sun, 17 Dec 2017 02:47:14 GMT (1743kb,D)
[v2] Tue, 26 Dec 2017 15:50:30 GMT (2779kb,D)
[v3] Fri, 30 Oct 2020 17:56:21 GMT (1289kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1712.06050v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Wasserstein Distributional Robustness and Regularization in Statistical Learning

Submission history