Current browse context:
stat.ME
Change to browse by:
References & Citations
Statistics > Methodology
Title: Cross-Validation for Correlated Data
(Submitted on 4 Apr 2019 (v1), revised 12 May 2019 (this version, v2), latest version 24 Apr 2020 (v3))
Abstract: K-fold cross-validation (CV) with squared error loss is widely used for evaluating predictive models, especially when strong distributional data assumptions cannot be taken. However, CV with squared error loss is not free from distributional assumptions, in particular in cases involving non-i.i.d data. This paper analyzes CV for correlated data. We present a criterion for suitability of CV, and introduce a bias corrected cross-validation prediction error estimator, $CV_c$, which is suitable in many settings involving correlated data, where CV is invalid. Our theoretical results are also demonstrated numerically.
Submission history
From: Assaf Rabinowicz [view email][v1] Thu, 4 Apr 2019 09:58:35 GMT (99kb,D)
[v2] Sun, 12 May 2019 10:26:05 GMT (50kb,D)
[v3] Fri, 24 Apr 2020 15:38:35 GMT (263kb,D)
Link back to: arXiv, form interface, contact.