We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Categorical Reparameterization with Gumbel-Softmax

Abstract: Categorical variables are a natural choice for representing discrete structure in the world. However, stochastic neural networks rarely use categorical latent variables due to the inability to backpropagate through samples. In this work, we present an efficient gradient estimator that replaces the non-differentiable sample from a categorical distribution with a differentiable sample from a novel Gumbel-Softmax distribution. This distribution has the essential property that it can be smoothly annealed into a categorical distribution. We show that our Gumbel-Softmax estimator outperforms state-of-the-art gradient estimators on structured output prediction and unsupervised generative modeling tasks with categorical latent variables, and enables large speedups on semi-supervised classification.
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as: arXiv:1611.01144 [stat.ML]
  (or arXiv:1611.01144v5 [stat.ML] for this version)

Submission history

From: Eric Jang [view email]
[v1] Thu, 3 Nov 2016 19:48:08 GMT (996kb,D)
[v2] Tue, 22 Nov 2016 23:18:13 GMT (856kb,D)
[v3] Fri, 17 Mar 2017 05:16:36 GMT (857kb,D)
[v4] Sat, 1 Apr 2017 15:33:06 GMT (855kb,D)
[v5] Sat, 5 Aug 2017 22:45:19 GMT (1774kb,D)

Link back to: arXiv, form interface, contact.