We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: Adaptive Variational Bayesian Inference for Sparse Deep Neural Network

Abstract: In this work, we focus on variational Bayesian inference on the sparse Deep Neural Network (DNN) modeled under a class of spike-and-slab priors. Given a pre-specified sparse DNN structure, the corresponding variational posterior contraction rate is characterized that reveals a trade-off between the variational error and the approximation error, which are both determined by the network structural complexity (i.e., depth, width and sparsity). However, the optimal network structure, which strikes the balance of the aforementioned trade-off and yields the best rate, is generally unknown in reality. Therefore, our work further develops an {\em adaptive} variational inference procedure that can automatically select a reasonably good (data-dependent) network structure that achieves the best contraction rate, without knowing the optimal network structure. In particular, when the true function is H{\"o}lder smooth, the adaptive variational inference is capable to attain (near-)optimal rate without the knowledge of smoothness level. The above rate still suffers from the curse of dimensionality, and thus motivates the teacher-student setup, i.e., the true function is a sparse DNN model, under which the rate only logarithmically depends on the input dimension.
Subjects: Statistics Theory (math.ST)
Cite as: arXiv:1910.04355 [math.ST]
  (or arXiv:1910.04355v3 [math.ST] for this version)

Submission history

From: Guang Cheng [view email]
[v1] Thu, 10 Oct 2019 03:44:09 GMT (189kb)
[v2] Sun, 2 Feb 2020 21:26:17 GMT (257kb)
[v3] Mon, 3 Aug 2020 02:44:59 GMT (643kb,D)

Link back to: arXiv, form interface, contact.