We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Semi-Supervised AUC Optimization based on Positive-Unlabeled Learning

Abstract: Maximizing the area under the receiver operating characteristic curve (AUC) is a standard approach to imbalanced classification. So far, various supervised AUC optimization methods have been developed and they are also extended to semi-supervised scenarios to cope with small sample problems. However, existing semi-supervised AUC optimization methods rely on strong distributional assumptions, which are rarely satisfied in real-world problems. In this paper, we propose a novel semi-supervised AUC optimization method that does not require such restrictive assumptions. We first develop an AUC optimization method based only on positive and unlabeled data (PU-AUC) and then extend it to semi-supervised learning by combining it with a supervised AUC optimization method. We theoretically prove that, without the restrictive distributional assumptions, unlabeled data contribute to improving the generalization performance in PU and semi-supervised AUC optimization methods. Finally, we demonstrate the practical usefulness of the proposed methods through experiments.
Comments: Fixed typos in Appendix
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
DOI: 10.1007/s10994-017-5678-9
Cite as: arXiv:1705.01708 [stat.ML]
  (or arXiv:1705.01708v3 [stat.ML] for this version)

Submission history

From: Tomoya Sakai [view email]
[v1] Thu, 4 May 2017 05:46:32 GMT (248kb,D)
[v2] Mon, 16 Oct 2017 15:25:32 GMT (223kb,D)
[v3] Mon, 11 Apr 2022 14:36:15 GMT (152kb,D)

Link back to: arXiv, form interface, contact.