Unbiased Risk Estimators Can Mislead: A Case Study of Learning with Complementary Labels

Chou, Yu-Ting; Niu, Gang; Lin, Hsuan-Tien; Sugiyama, Masashi

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2007

Computer Science > Machine Learning

Title: Unbiased Risk Estimators Can Mislead: A Case Study of Learning with Complementary Labels

Authors: Yu-Ting Chou, Gang Niu, Hsuan-Tien Lin, Masashi Sugiyama

(Submitted on 5 Jul 2020 (v1), last revised 21 Aug 2020 (this version, v3))

Abstract: In weakly supervised learning, unbiased risk estimator(URE) is a powerful tool for training classifiers when training and test data are drawn from different distributions. Nevertheless, UREs lead to overfitting in many problem settings when the models are complex like deep networks. In this paper, we investigate reasons for such overfitting by studying a weakly supervised problem called learning with complementary labels. We argue the quality of gradient estimation matters more in risk minimization. Theoretically, we show that a URE gives an unbiased gradient estimator(UGE). Practically, however, UGEs may suffer from huge variance, which causes empirical gradients to be usually far away from true gradients during minimization. To this end, we propose a novel surrogate complementary loss(SCL) framework that trades zero bias with reduced variance and makes empirical gradients more aligned with true gradients in the direction. Thanks to this characteristic, SCL successfully mitigates the overfitting issue and improves URE-based methods.

Comments:	Accepted at ICML 2020
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2007.02235 [cs.LG]
	(or arXiv:2007.02235v3 [cs.LG] for this version)

Submission history

From: Yu-Ting Chou [view email]
[v1] Sun, 5 Jul 2020 04:19:37 GMT (416kb,D)
[v2] Tue, 7 Jul 2020 08:28:25 GMT (416kb,D)
[v3] Fri, 21 Aug 2020 18:11:55 GMT (1243kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2007.02235

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Unbiased Risk Estimators Can Mislead: A Case Study of Learning with Complementary Labels

Submission history