Can Shallow Neural Networks Beat the Curse of Dimensionality? A mean field training perspective

Wojtowytsch, Stephan; E, Weinan

Full-text links:

Download:

Current browse context:

math

< prev | next >

new | recent | 2005

Computer Science > Machine Learning

Title: Can Shallow Neural Networks Beat the Curse of Dimensionality? A mean field training perspective

Authors: Stephan Wojtowytsch, Weinan E

(Submitted on 21 May 2020)

Abstract: We prove that the gradient descent training of a two-layer neural network on empirical or population risk may not decrease population risk at an order faster than $t^{-4/(d-2)}$ under mean field scaling. Thus gradient descent training for fitting reasonably smooth, but truly high-dimensional data may be subject to the curse of dimensionality. We present numerical evidence that gradient descent training with general Lipschitz target functions becomes slower and slower as the dimension increases, but converges at approximately the same rate in all dimensions when the target function lies in the natural function space for two-layer ReLU networks.

Comments:	5 figures
Subjects:	Machine Learning (cs.LG); Analysis of PDEs (math.AP); Machine Learning (stat.ML)
MSC classes:	68T07, 49Q22, 68W25
Cite as:	arXiv:2005.10815 [cs.LG]
	(or arXiv:2005.10815v1 [cs.LG] for this version)

Submission history

From: Stephan Wojtowytsch [view email]
[v1] Thu, 21 May 2020 17:50:15 GMT (575kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2005.10815

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Computer Science > Machine Learning

Title: Can Shallow Neural Networks Beat the Curse of Dimensionality? A mean field training perspective

Submission history