We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Spectral complexity of deep neural networks

Abstract: It is well-known that randomly initialized, push-forward, fully-connected neural networks weakly converge to isotropic Gaussian processes, in the limit where the width of all layers goes to infinity. In this paper, we propose to use the angular power spectrum of the limiting field to characterize the complexity of the network architecture. In particular, we define sequences of random variables associated with the angular power spectrum, and provide a full characterization of the network complexity in terms of the asymptotic distribution of these sequences as the depth diverges. On this basis, we classify neural networks as low-disorder, sparse, or high-disorder; we show how this classification highlights a number of distinct features for standard activation functions, and in particular, sparsity properties of ReLU networks. Our theoretical results are also validated by numerical simulations.
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Probability (math.PR)
MSC classes: 68T07, 60G60, 33C55, 62M15
Cite as: arXiv:2405.09541 [stat.ML]
  (or arXiv:2405.09541v1 [stat.ML] for this version)

Submission history

From: Stefano Vigogna [view email]
[v1] Wed, 15 May 2024 17:55:05 GMT (2217kb,D)

Link back to: arXiv, form interface, contact.