Current browse context:
cs.AI
Change to browse by:
References & Citations
Computer Science > Artificial Intelligence
Title: Meta-Learning MCMC Proposals
(Submitted on 21 Aug 2017 (v1), last revised 1 Jan 2019 (this version, v5))
Abstract: Effective implementations of sampling-based probabilistic inference often require manually constructed, model-specific proposals. Inspired by recent progresses in meta-learning for training learning agents that can generalize to unseen environments, we propose a meta-learning approach to building effective and generalizable MCMC proposals. We parametrize the proposal as a neural network to provide fast approximations to block Gibbs conditionals. The learned neural proposals generalize to occurrences of common structural motifs across different models, allowing for the construction of a library of learned inference primitives that can accelerate inference on unseen models with no model-specific training required. We explore several applications including open-universe Gaussian mixture models, in which our learned proposals outperform a hand-tuned sampler, and a real-world named entity recognition task, in which our sampler yields higher final F1 scores than classical single-site Gibbs sampling.
Submission history
From: Tongzhou Wang [view email][v1] Mon, 21 Aug 2017 00:44:32 GMT (1579kb,D)
[v2] Sun, 3 Dec 2017 18:47:50 GMT (1979kb,D)
[v3] Thu, 14 Dec 2017 04:32:39 GMT (1979kb,D)
[v4] Tue, 27 Nov 2018 12:09:11 GMT (2977kb,D)
[v5] Tue, 1 Jan 2019 06:47:06 GMT (2978kb,D)
Link back to: arXiv, form interface, contact.