We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.CO

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Computation

Title: Parallelising MCMC via Random Forests

Abstract: For Bayesian computation in big data contexts, the divide-and-conquer MCMC concept splits the whole data set into batches, runs MCMC algorithms separately over each batch to produce samples of parameters, and combines them to produce an approximation of the target distribution. In this article, we embed random forests into this framework and use each subposterior/partial-posterior as a proposal distribution to implement importance sampling. Unlike the existing divide-and-conquer MCMC, our methods are based on scaled subposteriors, whose scale factors are not necessarily restricted to being equal to one or to the number of subsets. Through several experiments, we show that our methods work well with models ranging from Gaussian cases to strongly non-Gaussian cases, and include model misspecification.
Comments: 12 pages
Subjects: Computation (stat.CO); Machine Learning (stat.ML)
Cite as: arXiv:1911.09698 [stat.CO]
  (or arXiv:1911.09698v1 [stat.CO] for this version)

Submission history

From: Christian P. Robert [view email]
[v1] Thu, 21 Nov 2019 19:02:13 GMT (583kb,D)

Link back to: arXiv, form interface, contact.