Current browse context:
stat.ML
Change to browse by:
References & Citations
Statistics > Machine Learning
Title: Distributed, partially collapsed MCMC for Bayesian Nonparametrics
(Submitted on 15 Jan 2020 (v1), last revised 4 Mar 2020 (this version, v3))
Abstract: Bayesian nonparametric (BNP) models provide elegant methods for discovering underlying latent features within a data set, but inference in such models can be slow. We exploit the fact that completely random measures, which commonly used models like the Dirichlet process and the beta-Bernoulli process can be expressed as, are decomposable into independent sub-measures. We use this decomposition to partition the latent measure into a finite measure containing only instantiated components, and an infinite measure containing all other components. We then select different inference algorithms for the two components: uncollapsed samplers mix well on the finite measure, while collapsed samplers mix well on the infinite, sparsely occupied tail. The resulting hybrid algorithm can be applied to a wide class of models, and can be easily distributed to allow scalable inference without sacrificing asymptotic convergence guarantees.
Submission history
From: Michael Minyi Zhang [view email][v1] Wed, 15 Jan 2020 23:10:13 GMT (2672kb,D)
[v2] Tue, 3 Mar 2020 09:29:19 GMT (5732kb,D)
[v3] Wed, 4 Mar 2020 13:57:15 GMT (5732kb,D)
Link back to: arXiv, form interface, contact.