We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Likelihood Inflating Sampling Algorithm

Abstract: Markov Chain Monte Carlo (MCMC) sampling from a posterior distribution corresponding to a massive data set can be computationally prohibitive since producing one sample requires a number of operations that is linear in the data size. In this paper, we introduce a new communication-free parallel method, the Likelihood Inflating Sampling Algorithm (LISA), that significantly reduces computational costs by randomly splitting the dataset into smaller subsets and running MCMC methods independently and in parallel on each subset using different processors. Each processor will draw sub-samples from sub-posterior distributions that are defined by "inflating" the likelihood function and the sub-samples are then combined using the importance re-sampling method to perform approximate full-data posterior samples. We test our method on several examples including the important case of Bayesian Additive Regression Trees (BART) using both simulated and real datasets. The method we propose shows significant efficiency gains over the existing Consensus Monte Carlo of Scott et al. (2013).
Comments: 46 pages, 15 figures, submitted
Subjects: Machine Learning (stat.ML); Computation (stat.CO)
Cite as: arXiv:1605.02113 [stat.ML]
  (or arXiv:1605.02113v1 [stat.ML] for this version)

Submission history

From: Radu V. Craiu [view email]
[v1] Fri, 6 May 2016 22:43:15 GMT (802kb,D)
[v2] Fri, 24 Feb 2017 06:19:11 GMT (354kb,D)
[v3] Fri, 30 Jun 2017 17:57:32 GMT (163kb,D)

Link back to: arXiv, form interface, contact.