Cold Posteriors and Aleatoric Uncertainty

Adlam, Ben; Snoek, Jasper; Smith, Samuel L.

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 2008

Statistics > Machine Learning

Title: Cold Posteriors and Aleatoric Uncertainty

Authors: Ben Adlam, Jasper Snoek, Samuel L. Smith

(Submitted on 31 Jul 2020)

Abstract: Recent work has observed that one can outperform exact inference in Bayesian neural networks by tuning the "temperature" of the posterior on a validation set (the "cold posterior" effect). To help interpret this phenomenon, we argue that commonly used priors in Bayesian neural networks can significantly overestimate the aleatoric uncertainty in the labels on many classification datasets. This problem is particularly pronounced in academic benchmarks like MNIST or CIFAR, for which the quality of the labels is high. For the special case of Gaussian process regression, any positive temperature corresponds to a valid posterior under a modified prior, and tuning this temperature is directly analogous to empirical Bayes. On classification tasks, there is no direct equivalence between modifying the prior and tuning the temperature, however reducing the temperature can lead to models which better reflect our belief that one gains little information by relabeling existing examples in the training set. Therefore although cold posteriors do not always correspond to an exact inference procedure, we believe they may often better reflect our true prior beliefs.

Comments:	5 pages, 3 figures
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Journal reference:	ICML workshop on Uncertainty and Robustness in Deep Learning (2020)
Cite as:	arXiv:2008.00029 [stat.ML]
	(or arXiv:2008.00029v1 [stat.ML] for this version)

Submission history

From: Ben Adlam [view email]
[v1] Fri, 31 Jul 2020 18:37:31 GMT (522kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:2008.00029

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Cold Posteriors and Aleatoric Uncertainty

Submission history