We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: On the Expressive Power of Deep Neural Networks

Abstract: We propose a new approach to the problem of neural network expressivity, which seeks to characterize how structural properties of a neural network family affect the functions it is able to compute. Our approach is based on an interrelated set of measures of expressivity, unified by the novel notion of trajectory length, which measures how the output of a network changes as the input sweeps along a one-dimensional path. Our findings can be summarized as follows:
(1) The complexity of the computed function grows exponentially with depth.
(2) All weights are not equal: trained networks are more sensitive to their lower (initial) layer weights.
(3) Regularizing on trajectory length (trajectory regularization) is a simpler alternative to batch normalization, with the same performance.
Comments: Accepted to ICML 2017
Subjects: Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as: arXiv:1606.05336 [stat.ML]
  (or arXiv:1606.05336v6 [stat.ML] for this version)

Submission history

From: Maithra Raghu [view email]
[v1] Thu, 16 Jun 2016 19:55:29 GMT (1051kb,D)
[v2] Fri, 24 Jun 2016 20:26:47 GMT (1789kb,D)
[v3] Wed, 17 Aug 2016 22:21:25 GMT (2267kb,D)
[v4] Mon, 3 Oct 2016 15:44:39 GMT (951kb,D)
[v5] Wed, 1 Mar 2017 03:00:26 GMT (3798kb,D)
[v6] Sun, 18 Jun 2017 13:24:34 GMT (4061kb,D)

Link back to: arXiv, form interface, contact.