Current browse context:
stat.ML
Change to browse by:
References & Citations
Statistics > Machine Learning
Title: Improved Gradient-Based Optimization Over Discrete Distributions
(Submitted on 29 Sep 2018 (v1), last revised 15 Jun 2019 (this version, v3))
Abstract: In many applications we seek to maximize an expectation with respect to a distribution over discrete variables. Estimating gradients of such objectives with respect to the distribution parameters is a challenging problem. We analyze existing solutions including finite-difference (FD) estimators and continuous relaxation (CR) estimators in terms of bias and variance. We show that the commonly used Gumbel-Softmax estimator is biased and propose a simple method to reduce it. We also derive a simpler piece-wise linear continuous relaxation that also possesses reduced bias. We demonstrate empirically that reduced bias leads to a better performance in variational inference and on binary optimization tasks.
Submission history
From: Evgeny Andriyash [view email][v1] Sat, 29 Sep 2018 00:07:28 GMT (296kb,D)
[v2] Fri, 28 Dec 2018 00:19:52 GMT (335kb,D)
[v3] Sat, 15 Jun 2019 23:58:32 GMT (335kb,D)
Link back to: arXiv, form interface, contact.