Critical Points of Neural Networks: Analytical Forms and Landscape Properties

Zhou, Yi; Liang, Yingbin

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 1710

Statistics > Machine Learning

Title: Critical Points of Neural Networks: Analytical Forms and Landscape Properties

Authors: Yi Zhou, Yingbin Liang

(Submitted on 30 Oct 2017)

Abstract: Due to the success of deep learning to solving a variety of challenging machine learning tasks, there is a rising interest in understanding loss functions for training neural networks from a theoretical aspect. Particularly, the properties of critical points and the landscape around them are of importance to determine the convergence performance of optimization algorithms. In this paper, we provide full (necessary and sufficient) characterization of the analytical forms for the critical points (as well as global minimizers) of the square loss functions for various neural networks. We show that the analytical forms of the critical points characterize the values of the corresponding loss functions as well as the necessary and sufficient conditions to achieve global minimum. Furthermore, we exploit the analytical forms of the critical points to characterize the landscape properties for the loss functions of these neural networks. One particular conclusion is that: The loss function of linear networks has no spurious local minimum, while the loss function of one-hidden-layer nonlinear networks with ReLU activation function does have local minimum that is not global minimum.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1710.11205 [stat.ML]
	(or arXiv:1710.11205v1 [stat.ML] for this version)

Submission history

From: Yi Zhou [view email]
[v1] Mon, 30 Oct 2017 19:18:43 GMT (39kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1710.11205

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Critical Points of Neural Networks: Analytical Forms and Landscape Properties

Submission history