The empirical size of trained neural networks

Chen, Kevin K.; Gamst, Anthony; Walker, Alden

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 1611

Statistics > Machine Learning

Title: The empirical size of trained neural networks

Authors: Kevin K. Chen, Anthony Gamst, Alden Walker

(Submitted on 29 Nov 2016)

Abstract: ReLU neural networks define piecewise linear functions of their inputs. However, initializing and training a neural network is very different from fitting a linear spline. In this paper, we expand empirically upon previous theoretical work to demonstrate features of trained neural networks. Standard network initialization and training produce networks vastly simpler than a naive parameter count would suggest and can impart odd features to the trained network. However, we also show the forced simplicity is beneficial and, indeed, critical for the wide success of these networks.

Comments:	6 pages, 5 figures
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1611.09444 [stat.ML]
	(or arXiv:1611.09444v1 [stat.ML] for this version)

Submission history

From: Alden Walker [view email]
[v1] Tue, 29 Nov 2016 00:39:45 GMT (1366kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1611.09444

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: The empirical size of trained neural networks

Submission history