We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Improved Gradient-Based Optimization Over Discrete Distributions

Abstract: In many applications we seek to maximize an expectation with respect to a distribution over discrete variables. Estimating gradients of such objectives with respect to the distribution parameters is a challenging problem. We analyze existing solutions including finite-difference (FD) estimators and continuous relaxation (CR) estimators in terms of bias and variance. We show that the commonly used Gumbel-Softmax estimator is biased and propose a simple method to reduce it. We also derive a simpler piece-wise linear continuous relaxation that also possesses reduced bias. We demonstrate empirically that reduced bias leads to a better performance in variational inference and on binary optimization tasks.
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as: arXiv:1810.00116 [stat.ML]
  (or arXiv:1810.00116v3 [stat.ML] for this version)

Submission history

From: Evgeny Andriyash [view email]
[v1] Sat, 29 Sep 2018 00:07:28 GMT (296kb,D)
[v2] Fri, 28 Dec 2018 00:19:52 GMT (335kb,D)
[v3] Sat, 15 Jun 2019 23:58:32 GMT (335kb,D)

Link back to: arXiv, form interface, contact.