Characteristics of Monte Carlo Dropout in Wide Neural Networks

Sicking, Joachim; Akila, Maram; Wirtz, Tim; Houben, Sebastian; Fischer, Asja

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2007

Computer Science > Machine Learning

Title: Characteristics of Monte Carlo Dropout in Wide Neural Networks

Authors: Joachim Sicking, Maram Akila, Tim Wirtz, Sebastian Houben, Asja Fischer

(Submitted on 10 Jul 2020)

Abstract: Monte Carlo (MC) dropout is one of the state-of-the-art approaches for uncertainty estimation in neural networks (NNs). It has been interpreted as approximately performing Bayesian inference. Based on previous work on the approximation of Gaussian processes by wide and deep neural networks with random weights, we study the limiting distribution of wide untrained NNs under dropout more rigorously and prove that they as well converge to Gaussian processes for fixed sets of weights and biases. We sketch an argument that this property might also hold for infinitely wide feed-forward networks that are trained with (full-batch) gradient descent. The theory is contrasted by an empirical analysis in which we find correlations and non-Gaussian behaviour for the pre-activations of finite width NNs. We therefore investigate how (strongly) correlated pre-activations can induce non-Gaussian behavior in NNs with strongly correlated weights.

Comments:	Accepted at the ICML 2020 workshop for Uncertainty and Robustness in Deep Learning
Subjects:	Machine Learning (cs.LG); Statistics Theory (math.ST); Machine Learning (stat.ML)
Cite as:	arXiv:2007.05434 [cs.LG]
	(or arXiv:2007.05434v1 [cs.LG] for this version)

Submission history

From: Maram Akila [view email]
[v1] Fri, 10 Jul 2020 15:14:43 GMT (2259kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2007.05434

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Characteristics of Monte Carlo Dropout in Wide Neural Networks

Submission history