Current browse context:
cs.LG
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: Liberty or Depth: Deep Bayesian Neural Nets Do Not Need Complex Weight Posterior Approximations
(Submitted on 10 Feb 2020 (v1), last revised 10 Mar 2021 (this version, v4))
Abstract: We challenge the longstanding assumption that the mean-field approximation for variational inference in Bayesian neural networks is severely restrictive, and show this is not the case in deep networks. We prove several results indicating that deep mean-field variational weight posteriors can induce similar distributions in function-space to those induced by shallower networks with complex weight posteriors. We validate our theoretical contributions empirically, both through examination of the weight posterior using Hamiltonian Monte Carlo in small models and by comparing diagonal- to structured-covariance in large settings. Since complex variational posteriors are often expensive and cumbersome to implement, our results suggest that using mean-field variational inference in a deeper model is both a practical and theoretically justified alternative to structured approximations.
Submission history
From: Sebastian Farquhar [view email][v1] Mon, 10 Feb 2020 13:11:45 GMT (1244kb,D)
[v2] Wed, 8 Jul 2020 10:39:50 GMT (4208kb,D)
[v3] Mon, 2 Nov 2020 11:55:29 GMT (3824kb,D)
[v4] Wed, 10 Mar 2021 09:19:13 GMT (3802kb,D)
Link back to: arXiv, form interface, contact.