We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.CO

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Computation

Title: Scalable MCMC for Bayes Shrinkage Priors

Abstract: Gaussian scale mixture priors are frequently employed in Bayesian analysis of high-dimensional models, and several members of this family have optimal risk properties when the truth is sparse. While optimization-based algorithms for the extremely popular Lasso and elastic net procedures can scale to dimension in the hundreds of thousands, corresponding Bayesian methods that use Markov chain Monte Carlo (MCMC) for computation are limited to problems at least an order of magnitude smaller. This is due to high computational cost per step of the associated Markov kernel and growth of the variance of time-averaging estimators as a function of dimension. We propose an MCMC algorithm for computation in these models that combines block updating and approximations of the Markov kernel to directly combat both of these factors. Our algorithm gives orders of magnitude speedup over the best existing alternatives in high-dimensional applications. We give theoretical guarantees for the accuracy of the kernel approximation. The scalability of the algorithm is illustrated in simulations with problem size as large as $N=5,000$ observations and $p=50,000$ predictors, and an application to a genome wide association study with $N=2,267$ and $p=98,385$. The empirical results also show that the new algorithm yields estimates with lower mean squared error, intervals with better coverage, and elucidates features of the posterior that were often missed by previous algorithms in high dimensions, including bimodality of posterior marginals indicating uncertainty about which covariates belong in the model. This latter feature is an important motivation for a Bayesian approach to testing and selection in high dimensions.
Subjects: Computation (stat.CO)
Cite as: arXiv:1705.00841 [stat.CO]
  (or arXiv:1705.00841v2 [stat.CO] for this version)

Submission history

From: James Johndrow [view email]
[v1] Tue, 2 May 2017 08:03:29 GMT (474kb,D)
[v2] Sun, 1 Apr 2018 14:49:16 GMT (673kb,D)
[v3] Mon, 15 Oct 2018 05:48:14 GMT (759kb,D)

Link back to: arXiv, form interface, contact.