We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Cross-Validation for Correlated Data

Abstract: K-fold cross-validation (CV) with squared error loss is widely used for evaluating predictive models, especially when strong distributional data assumptions cannot be taken. However, CV with squared error loss is not free from distributional assumptions, in particular in cases involving non-i.i.d data. This paper analyzes CV for correlated data. We present a criterion for suitability of CV, and introduce a bias corrected cross-validation prediction error estimator, $CV_c$, which is suitable in many settings involving correlated data, where CV is invalid. Our theoretical results are also demonstrated numerically.
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
Cite as: arXiv:1904.02438 [stat.ME]
  (or arXiv:1904.02438v2 [stat.ME] for this version)

Submission history

From: Assaf Rabinowicz [view email]
[v1] Thu, 4 Apr 2019 09:58:35 GMT (99kb,D)
[v2] Sun, 12 May 2019 10:26:05 GMT (50kb,D)
[v3] Fri, 24 Apr 2020 15:38:35 GMT (263kb,D)

Link back to: arXiv, form interface, contact.