Current browse context:
stat.ML
Change to browse by:
References & Citations
Statistics > Machine Learning
Title: Likelihood Inflating Sampling Algorithm
(Submitted on 6 May 2016 (this version), latest version 30 Jun 2017 (v3))
Abstract: Markov Chain Monte Carlo (MCMC) sampling from a posterior distribution corresponding to a massive data set can be computationally prohibitive since producing one sample requires a number of operations that is linear in the data size. In this paper, we introduce a new communication-free parallel method, the Likelihood Inflating Sampling Algorithm (LISA), that significantly reduces computational costs by randomly splitting the dataset into smaller subsets and running MCMC methods independently and in parallel on each subset using different processors. Each processor will draw sub-samples from sub-posterior distributions that are defined by "inflating" the likelihood function and the sub-samples are then combined using the importance re-sampling method to perform approximate full-data posterior samples. We test our method on several examples including the important case of Bayesian Additive Regression Trees (BART) using both simulated and real datasets. The method we propose shows significant efficiency gains over the existing Consensus Monte Carlo of Scott et al. (2013).
Submission history
From: Radu V. Craiu [view email][v1] Fri, 6 May 2016 22:43:15 GMT (802kb,D)
[v2] Fri, 24 Feb 2017 06:19:11 GMT (354kb,D)
[v3] Fri, 30 Jun 2017 17:57:32 GMT (163kb,D)
Link back to: arXiv, form interface, contact.