f-GANs in an Information Geometric Nutshell

Nock, Richard; Cranko, Zac; Menon, Aditya Krishna; Qu, Lizhen; Williamson, Robert C.

Full-text links:

Download:

Current browse context:

stat

< prev | next >

new | recent | 1707

Computer Science > Machine Learning

Title: f-GANs in an Information Geometric Nutshell

Authors: Richard Nock, Zac Cranko, Aditya Krishna Menon, Lizhen Qu, Robert C. Williamson

(Submitted on 14 Jul 2017)

Abstract: Nowozin \textit{et al} showed last year how to extend the GAN \textit{principle} to all $f$-divergences. The approach is elegant but falls short of a full description of the supervised game, and says little about the key player, the generator: for example, what does the generator actually converge to if solving the GAN game means convergence in some space of parameters? How does that provide hints on the generator's design and compare to the flourishing but almost exclusively experimental literature on the subject?
In this paper, we unveil a broad class of distributions for which such convergence happens --- namely, deformed exponential families, a wide superset of exponential families --- and show tight connections with the three other key GAN parameters: loss, game and architecture. In particular, we show that current deep architectures are able to factorize a very large number of such densities using an especially compact design, hence displaying the power of deep architectures and their concinnity in the $f$-GAN game. This result holds given a sufficient condition on \textit{activation functions} --- which turns out to be satisfied by popular choices. The key to our results is a variational generalization of an old theorem that relates the KL divergence between regular exponential families and divergences between their natural parameters. We complete this picture with additional results and experimental insights on how these results may be used to ground further improvements of GAN architectures, via (i) a principled design of the activation functions in the generator and (ii) an explicit integration of proper composite losses' link function in the discriminator.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
ACM classes:	I.2.6; I.5.1
Cite as:	arXiv:1707.04385 [cs.LG]
	(or arXiv:1707.04385v1 [cs.LG] for this version)

Submission history

From: Richard Nock [view email]
[v1] Fri, 14 Jul 2017 05:07:52 GMT (5288kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1707.04385

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Computer Science > Machine Learning

Title: f-GANs in an Information Geometric Nutshell

Submission history