Current browse context:
stat.ME
Change to browse by:
References & Citations
Statistics > Methodology
Title: Bayesian Variable Selection in a Million Dimensions
(Submitted on 2 Aug 2022 (v1), last revised 31 Aug 2022 (this version, v2))
Abstract: Bayesian variable selection is a powerful tool for data analysis, as it offers a principled method for variable selection that accounts for prior information and uncertainty. However, wider adoption of Bayesian variable selection has been hampered by computational challenges, especially in difficult regimes with a large number of covariates P or non-conjugate likelihoods. To scale to the large P regime we introduce an efficient MCMC scheme whose cost per iteration is sublinear in P. In addition we show how this scheme can be extended to generalized linear models for count data, which are prevalent in biology, ecology, economics, and beyond. In particular we design efficient algorithms for variable selection in binomial and negative binomial regression, which includes logistic regression as a special case. In experiments we demonstrate the effectiveness of our methods, including on cancer and maize genomic data.
Submission history
From: Martin Jankowiak [view email][v1] Tue, 2 Aug 2022 00:11:15 GMT (1101kb,D)
[v2] Wed, 31 Aug 2022 14:13:13 GMT (1105kb,D)
Link back to: arXiv, form interface, contact.