We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: On Consistency of Graph-based Semi-supervised Learning

Abstract: Graph-based semi-supervised learning is one of the most popular methods in machine learning. Some of its theoretical properties such as bounds for the generalization error and the convergence of the graph Laplacian regularizer have been studied in computer science and statistics literatures. However, a fundamental statistical property, the consistency of the estimator from this method has not been proved. In this article, we study the consistency problem under a non-parametric framework. We prove the consistency of graph-based learning in the case that the estimated scores are enforced to be equal to the observed responses for the labeled data. The sample sizes of both labeled and unlabeled data are allowed to grow in this result. When the estimated scores are not required to be equal to the observed responses, a tuning parameter is used to balance the loss function and the graph Laplacian regularizer. We give a counterexample demonstrating that the estimator for this case can be inconsistent. The theoretical findings are supported by numerical studies.
Comments: This paper is accepted by 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS)
Subjects: Machine Learning (stat.ML)
Journal reference: IEEE 39th International Conference on Distributed Computing Systems (ICDCS) 2019
Cite as: arXiv:1703.06177 [stat.ML]
  (or arXiv:1703.06177v2 [stat.ML] for this version)

Submission history

From: Chengan Du [view email]
[v1] Fri, 17 Mar 2017 19:24:09 GMT (31kb,D)
[v2] Wed, 10 Apr 2019 20:10:48 GMT (223kb)

Link back to: arXiv, form interface, contact.