We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Deep Discriminative to Kernel Density Graph for In- and Out-of-distribution Calibrated Inference

Abstract: Deep discriminative approaches like random forests and deep neural networks have recently found applications in many important real-world scenarios. However, deploying these learning algorithms in safety-critical applications raises concerns, particularly when it comes to ensuring confidence calibration for both in-distribution and out-of-distribution data points. Many popular methods for in-distribution (ID) calibration, such as isotonic regression and Platt's sigmoidal regression, exhibit excellent ID calibration performance. However, these methods are not calibrated for the entire feature space, leading to overconfidence in the case of out-of-distribution (OOD) samples. On the other end of the spectrum, existing out-of-distribution (OOD) calibration methods generally exhibit poor in-distribution (ID) calibration. In this paper, we address ID and OOD calibration problems jointly. We leveraged the fact that deep models, including both random forests and deep-nets, learn internal representations which are unions of polytopes with affine activation functions to conceptualize them both as partitioning rules of the feature space. We replace the affine function in each polytope populated by the training data with a Gaussian kernel. We propose sufficient conditions for our proposed methods to be consistent estimators of the corresponding class conditional densities. Moreover, our experiments on both tabular and vision benchmarks show that the proposed approaches obtain well-calibrated posteriors while mostly preserving or improving the classification accuracy of the original algorithm for in-distribution region, and extrapolates beyond the training data to handle out-of-distribution inputs appropriately.
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Data Structures and Algorithms (cs.DS); Neurons and Cognition (q-bio.NC); Machine Learning (stat.ML)
Cite as: arXiv:2201.13001 [cs.LG]
  (or arXiv:2201.13001v7 [cs.LG] for this version)

Submission history

From: Jayanta Dey [view email]
[v1] Mon, 31 Jan 2022 05:07:16 GMT (16278kb,D)
[v2] Sun, 6 Feb 2022 14:38:23 GMT (16262kb,D)
[v3] Mon, 14 Feb 2022 14:14:47 GMT (16263kb,D)
[v4] Fri, 25 Mar 2022 03:41:23 GMT (16263kb,D)
[v5] Fri, 19 May 2023 21:15:28 GMT (18650kb,D)
[v6] Thu, 19 Oct 2023 02:27:07 GMT (33996kb,D)
[v7] Tue, 12 Mar 2024 12:57:20 GMT (33151kb,D)

Link back to: arXiv, form interface, contact.