References & Citations
Computer Science > Machine Learning
Title: Semi-Supervised Classification Based on Classification from Positive and Unlabeled Data
(Submitted on 23 May 2016 (v1), revised 14 Oct 2016 (this version, v2), latest version 16 Jun 2017 (v4))
Abstract: Most of the semi-supervised learning methods developed so far use unlabeled data for regularization purposes under particular distributional assumptions such as the manifold assumption. On the other hand, recently developed methods of learning from positive and unlabeled data (PU learning) use unlabeled data for loss evaluation, i.e., label information is directly extracted from unlabeled data. In this paper, we extend PU learning to also incorporate negative data and propose a novel semi-supervised learning approach. We establish a generalization error bound for our novel method and show that the bound decreases with respect to the number of unlabeled data without the distributional assumptions that are required in existing semi-supervised learning methods. Through experiments, we demonstrate the usefulness of the proposed method.
Submission history
From: Tomoya Sakai [view email][v1] Mon, 23 May 2016 09:37:48 GMT (153kb,D)
[v2] Fri, 14 Oct 2016 14:04:24 GMT (261kb,D)
[v3] Wed, 1 Mar 2017 11:39:31 GMT (406kb,D)
[v4] Fri, 16 Jun 2017 11:14:36 GMT (1750kb,D)
Link back to: arXiv, form interface, contact.