We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Unbiased Risk Estimators Can Mislead: A Case Study of Learning with Complementary Labels

Abstract: In weakly supervised learning, unbiased risk estimator(URE) is a powerful tool for training classifiers when training and test data are drawn from different distributions. Nevertheless, UREs lead to overfitting in many problem settings when the models are complex like deep networks. In this paper, we investigate reasons for such overfitting by studying a weakly supervised problem called learning with complementary labels. We argue the quality of gradient estimation matters more in risk minimization. Theoretically, we show that a URE gives an unbiased gradient estimator(UGE). Practically, however, UGEs may suffer from huge variance, which causes empirical gradients to be usually far away from true gradients during minimization. To this end, we propose a novel surrogate complementary loss(SCL) framework that trades zero bias with reduced variance and makes empirical gradients more aligned with true gradients in the direction. Thanks to this characteristic, SCL successfully mitigates the overfitting issue and improves URE-based methods.
Comments: Accepted at ICML 2020
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:2007.02235 [cs.LG]
  (or arXiv:2007.02235v3 [cs.LG] for this version)

Submission history

From: Yu-Ting Chou [view email]
[v1] Sun, 5 Jul 2020 04:19:37 GMT (416kb,D)
[v2] Tue, 7 Jul 2020 08:28:25 GMT (416kb,D)
[v3] Fri, 21 Aug 2020 18:11:55 GMT (1243kb,D)

Link back to: arXiv, form interface, contact.