Learning to Predict Trustworthiness with Steep Slope Loss

Luo, Yan; Wong, Yongkang; Kankanhalli, Mohan S.; Zhao, Qi

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2110

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Learning to Predict Trustworthiness with Steep Slope Loss

Authors: Yan Luo, Yongkang Wong, Mohan S. Kankanhalli, Qi Zhao

(Submitted on 30 Sep 2021 (v1), last revised 27 Oct 2021 (this version, v2))

Abstract: Understanding the trustworthiness of a prediction yielded by a classifier is critical for the safe and effective use of AI models. Prior efforts have been proven to be reliable on small-scale datasets. In this work, we study the problem of predicting trustworthiness on real-world large-scale datasets, where the task is more challenging due to high-dimensional features, diverse visual concepts, and large-scale samples. In such a setting, we observe that the trustworthiness predictors trained with prior-art loss functions, i.e., the cross entropy loss, focal loss, and true class probability confidence loss, are prone to view both correct predictions and incorrect predictions to be trustworthy. The reasons are two-fold. Firstly, correct predictions are generally dominant over incorrect predictions. Secondly, due to the data complexity, it is challenging to differentiate the incorrect predictions from the correct ones on real-world large-scale datasets. To improve the generalizability of trustworthiness predictors, we propose a novel steep slope loss to separate the features w.r.t. correct predictions from the ones w.r.t. incorrect predictions by two slide-like curves that oppose each other. The proposed loss is evaluated with two representative deep learning models, i.e., Vision Transformer and ResNet, as trustworthiness predictors. We conduct comprehensive experiments and analyses on ImageNet, which show that the proposed loss effectively improves the generalizability of trustworthiness predictors. The code and pre-trained trustworthiness predictors for reproducibility are available at this https URL

Comments:	NeurIPS 2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2110.00054 [cs.CV]
	(or arXiv:2110.00054v2 [cs.CV] for this version)

Submission history

From: Yan Luo [view email]
[v1] Thu, 30 Sep 2021 19:19:09 GMT (26569kb,D)
[v2] Wed, 27 Oct 2021 21:28:16 GMT (26647kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2110.00054v2

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Learning to Predict Trustworthiness with Steep Slope Loss

Submission history