Adaptive Variational Bayesian Inference for Sparse Deep Neural Network

Bai, Jincheng; Song, Qifan; Cheng, Guang

Full-text links:

Download:

Current browse context:

math.ST

< prev | next >

new | recent | 1910

Mathematics > Statistics Theory

Title: Adaptive Variational Bayesian Inference for Sparse Deep Neural Network

Authors: Jincheng Bai, Qifan Song, Guang Cheng

(Submitted on 10 Oct 2019 (v1), last revised 3 Aug 2020 (this version, v3))

Abstract: In this work, we focus on variational Bayesian inference on the sparse Deep Neural Network (DNN) modeled under a class of spike-and-slab priors. Given a pre-specified sparse DNN structure, the corresponding variational posterior contraction rate is characterized that reveals a trade-off between the variational error and the approximation error, which are both determined by the network structural complexity (i.e., depth, width and sparsity). However, the optimal network structure, which strikes the balance of the aforementioned trade-off and yields the best rate, is generally unknown in reality. Therefore, our work further develops an {\em adaptive} variational inference procedure that can automatically select a reasonably good (data-dependent) network structure that achieves the best contraction rate, without knowing the optimal network structure. In particular, when the true function is H{\"o}lder smooth, the adaptive variational inference is capable to attain (near-)optimal rate without the knowledge of smoothness level. The above rate still suffers from the curse of dimensionality, and thus motivates the teacher-student setup, i.e., the true function is a sparse DNN model, under which the rate only logarithmically depends on the input dimension.

Subjects:	Statistics Theory (math.ST)
Cite as:	arXiv:1910.04355 [math.ST]
	(or arXiv:1910.04355v3 [math.ST] for this version)

Submission history

From: Guang Cheng [view email]
[v1] Thu, 10 Oct 2019 03:44:09 GMT (189kb)
[v2] Sun, 2 Feb 2020 21:26:17 GMT (257kb)
[v3] Mon, 3 Aug 2020 02:44:59 GMT (643kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> math > arXiv:1910.04355

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Mathematics > Statistics Theory

Title: Adaptive Variational Bayesian Inference for Sparse Deep Neural Network

Submission history