We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Sample Complexity versus Depth: An Information Theoretic Analysis

Abstract: Deep learning has proven effective across a range of data sets. In light of this, a natural inquiry is: "for what data generating processes can deep learning succeed?" In this work, we study the sample complexity of learning multilayer data generating processes of a sort for which deep neural networks seem to be suited. We develop general and elegant information-theoretic tools that accommodate analysis of any data generating process -- shallow or deep, parametric or nonparametric, noiseless or noisy. We then use these tools to characterize the dependence of sample complexity on the depth of multilayer processes. Our results indicate roughly linear dependence on depth. This is in contrast to previous results that suggest exponential or high-order polynomial dependence.
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as: arXiv:2203.00246 [cs.LG]
  (or arXiv:2203.00246v3 [cs.LG] for this version)

Submission history

From: Hong Jun Jeon [view email]
[v1] Tue, 1 Mar 2022 05:58:28 GMT (141kb,D)
[v2] Fri, 4 Mar 2022 23:53:39 GMT (141kb,D)
[v3] Thu, 7 Apr 2022 22:33:12 GMT (141kb,D)
[v4] Sun, 22 May 2022 20:29:40 GMT (278kb,D)
[v5] Tue, 7 Jun 2022 21:13:39 GMT (932kb,D)
[v6] Fri, 24 Mar 2023 19:48:25 GMT (20342kb,D)

Link back to: arXiv, form interface, contact.