We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Learning Curves for Analysis of Deep Networks

Abstract: Learning curves model a classifier's test error as a function of the number of training samples. Prior works show that learning curves can be used to select model parameters and extrapolate performance. We investigate how to use learning curves to evaluate design choices, such as pretraining, architecture, and data augmentation. We propose a method to robustly estimate learning curves, abstract their parameters into error and data-reliance, and evaluate the effectiveness of different parameterizations. Our experiments exemplify use of learning curves for analysis and yield several interesting observations.
Comments: Improved text and figure organization, additional experiments on optimization
Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as: arXiv:2010.11029 [cs.LG]
  (or arXiv:2010.11029v2 [cs.LG] for this version)

Submission history

From: Derek Hoiem [view email]
[v1] Wed, 21 Oct 2020 14:20:05 GMT (6043kb,D)
[v2] Mon, 5 Apr 2021 17:01:02 GMT (10589kb,D)

Link back to: arXiv, form interface, contact.