We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Mathematics > Statistics Theory

Title: Risk bounds for PU learning under Selected At Random assumption

Authors: Olivier Coudray (CELESTE), Christine Keribin (CELESTE), Pascal Massart (CELESTE), Patrick Pamphile (CELESTE)
Abstract: Positive-unlabeled learning (PU learning) is known as a special case of semi-supervised binary classification where only a fraction of positive examples are labeled. The challenge is then to find the correct classifier despite this lack of information. Recently, new methodologies have been introduced to address the case where the probability of being labeled may depend on the covariates. In this paper, we are interested in establishing risk bounds for PU learning under this general assumption. In addition, we quantify the impact of label noise on PU learning compared to standard classification setting. Finally, we provide a lower bound on minimax risk proving that the upper bound is almost optimal.
Subjects: Statistics Theory (math.ST); Machine Learning (stat.ML)
Cite as: arXiv:2201.06277 [math.ST]
  (or arXiv:2201.06277v1 [math.ST] for this version)

Submission history

From: Olivier Coudray [view email]
[v1] Mon, 17 Jan 2022 08:45:39 GMT (39kb)

Link back to: arXiv, form interface, contact.