We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Distributed, partially collapsed MCMC for Bayesian Nonparametrics

Abstract: Bayesian nonparametric (BNP) models provide elegant methods for discovering underlying latent features within a data set, but inference in such models can be slow. We exploit the fact that completely random measures, which commonly used models like the Dirichlet process and the beta-Bernoulli process can be expressed as, are decomposable into independent sub-measures. We use this decomposition to partition the latent measure into a finite measure containing only instantiated components, and an infinite measure containing all other components. We then select different inference algorithms for the two components: uncollapsed samplers mix well on the finite measure, while collapsed samplers mix well on the infinite, sparsely occupied tail. The resulting hybrid algorithm can be applied to a wide class of models, and can be easily distributed to allow scalable inference without sacrificing asymptotic convergence guarantees.
Comments: To appear in the 23rd International Conference on Artificial Intelligence and Statistics
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Journal reference: Artificial Intelligence and Statistics, 108:3685-3695, 2020
Cite as: arXiv:2001.05591 [stat.ML]
  (or arXiv:2001.05591v3 [stat.ML] for this version)

Submission history

From: Michael Minyi Zhang [view email]
[v1] Wed, 15 Jan 2020 23:10:13 GMT (2672kb,D)
[v2] Tue, 3 Mar 2020 09:29:19 GMT (5732kb,D)
[v3] Wed, 4 Mar 2020 13:57:15 GMT (5732kb,D)

Link back to: arXiv, form interface, contact.