We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Network cross-validation by edge sampling

Abstract: While many statistical models and methods are now available for network analysis, resampling network data remains a challenging problem. Cross-validation is a useful general tool for model selection and parameter tuning, but is not directly applicable to networks since splitting network nodes into groups requires deleting edges and destroys some of the network structure. Here we propose a new network resampling strategy based on splitting node pairs rather than nodes applicable to cross-validation for a wide range of network model selection tasks. We provide a theoretical justification for our method in a general setting and examples of how our method can be used in specific network model selection and parameter tuning tasks. Numerical results on simulated networks and on a citation network of statisticians show that this cross-validation approach works well for model selection.
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
Cite as: arXiv:1612.04717 [stat.ME]
  (or arXiv:1612.04717v7 [stat.ME] for this version)

Submission history

From: Tianxi Li [view email]
[v1] Wed, 14 Dec 2016 16:36:30 GMT (792kb,D)
[v2] Thu, 15 Dec 2016 19:34:19 GMT (800kb,D)
[v3] Sun, 1 Jan 2017 21:03:42 GMT (1268kb)
[v4] Wed, 13 Sep 2017 20:05:34 GMT (2009kb,D)
[v5] Wed, 9 May 2018 18:49:21 GMT (2018kb,D)
[v6] Thu, 14 Mar 2019 02:00:51 GMT (1915kb,D)
[v7] Fri, 1 May 2020 07:38:59 GMT (3505kb,D)

Link back to: arXiv, form interface, contact.