We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: Cross-validation

Authors: Sylvain Arlot (SELECT, LMO)
Abstract: This text is a survey on cross-validation. We define all classical cross-validation procedures, and we study their properties for two different goals: estimating the risk of a given estimator, and selecting the best estimator among a given family. For the risk estimation problem, we compute the bias (which can also be corrected) and the variance of cross-validation methods. For estimator selection, we first provide a first-order analysis (based on expectations). Then, we explain how to take into account second-order terms (from variance computations, and by taking into account the usefulness of overpenalization). This allows, in the end, to provide some guidelines for choosing the best cross-validation method for a given learning problem.
Comments: in French
Subjects: Statistics Theory (math.ST); Machine Learning (stat.ML)
Cite as: arXiv:1703.03167 [math.ST]
  (or arXiv:1703.03167v1 [math.ST] for this version)

Submission history

From: Sylvain Arlot [view email]
[v1] Thu, 9 Mar 2017 07:40:53 GMT (40kb,D)

Link back to: arXiv, form interface, contact.