We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Theoretical Comparisons of Positive-Unlabeled Learning against Positive-Negative Learning

Abstract: In PU learning, a binary classifier is trained from positive (P) and unlabeled (U) data without negative (N) data. Although N data is missing, it sometimes outperforms PN learning (i.e., ordinary supervised learning). Hitherto, neither theoretical nor experimental analysis has been given to explain this phenomenon. In this paper, we theoretically compare PU (and NU) learning against PN learning based on the upper bounds on estimation errors. We find simple conditions when PU and NU learning are likely to outperform PN learning, and we prove that, in terms of the upper bounds, either PU or NU learning (depending on the class-prior probability and the sizes of P and N data) given infinite U data will improve on PN learning. Our theoretical findings well agree with the experimental results on artificial and benchmark data even when the experimental setup does not match the theoretical assumptions exactly.
Comments: NIPS 2016 camera-ready version
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:1603.03130 [cs.LG]
  (or arXiv:1603.03130v3 [cs.LG] for this version)

Submission history

From: Gang Niu [view email]
[v1] Thu, 10 Mar 2016 02:53:52 GMT (47kb)
[v2] Mon, 23 May 2016 04:35:47 GMT (169kb,D)
[v3] Fri, 28 Oct 2016 13:37:46 GMT (169kb,D)

Link back to: arXiv, form interface, contact.