Current browse context:
stat.CO
Change to browse by:
References & Citations
Statistics > Methodology
Title: Distributed Bayesian Varying Coefficient Modeling Using a Gaussian Process Prior
(Submitted on 1 Jun 2020 (v1), last revised 26 Feb 2022 (this version, v2))
Abstract: Varying coefficient models (VCMs) are widely used for estimating nonlinear regression functions for functional data. Their Bayesian variants using Gaussian process priors on the functional coefficients, however, have received limited attention in massive data applications, mainly due to the prohibitively slow posterior computations using Markov chain Monte Carlo (MCMC) algorithms. We address this problem using a divide-and-conquer Bayesian approach. We first create a large number of data subsamples with much smaller sizes. Then, we formulate the VCM as a linear mixed-effects model and develop a data augmentation algorithm for obtaining MCMC draws on all the subsets in parallel. Finally, we aggregate the MCMC-based estimates of subset posteriors into a single Aggregated Monte Carlo (AMC) posterior, which is used as a computationally efficient alternative to the true posterior distribution. Theoretically, we derive minimax optimal posterior convergence rates for the AMC posteriors of both the varying coefficients and the mean regression function. We provide quantification on the orders of subset sample sizes and the number of subsets. The empirical results show that the combination schemes that satisfy our theoretical assumptions, including the AMC posterior, have better estimation performance than their main competitors across diverse simulations and in a real data analysis.
Submission history
From: Cheng Li [view email][v1] Mon, 1 Jun 2020 08:16:45 GMT (2165kb,D)
[v2] Sat, 26 Feb 2022 01:43:59 GMT (2164kb,D)
Link back to: arXiv, form interface, contact.