We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Cross-validation Confidence Intervals for Test Error

Abstract: This work develops central limit theorems for cross-validation and consistent estimators of its asymptotic variance under weak stability conditions on the learning algorithm. Together, these results provide practical, asymptotically-exact confidence intervals for $k$-fold test error and valid, powerful hypothesis tests of whether one learning algorithm has smaller $k$-fold test error than another. These results are also the first of their kind for the popular choice of leave-one-out cross-validation. In our real-data experiments with diverse learning algorithms, the resulting intervals and tests outperform the most popular alternative methods from the literature.
Comments: 34th Conference on Neural Information Processing Systems (NeurIPS 2020); 40 pages, 15 figures
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST)
Cite as: arXiv:2007.12671 [stat.ML]
  (or arXiv:2007.12671v2 [stat.ML] for this version)

Submission history

From: Pierre Bayle [view email]
[v1] Fri, 24 Jul 2020 17:40:06 GMT (262kb,D)
[v2] Sat, 31 Oct 2020 17:24:26 GMT (278kb,D)

Link back to: arXiv, form interface, contact.