Approximation and Gradient Descent Training with Neural Networks

Welper, G.

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2405

Change to browse by:

Computer Science > Machine Learning

Title: Approximation and Gradient Descent Training with Neural Networks

Authors: G. Welper

(Submitted on 19 May 2024)

Abstract: It is well understood that neural networks with carefully hand-picked weights provide powerful function approximation and that they can be successfully trained in over-parametrized regimes. Since over-parametrization ensures zero training error, these two theories are not immediately compatible. Recent work uses the smoothness that is required for approximation results to extend a neural tangent kernel (NTK) optimization argument to an under-parametrized regime and show direct approximation bounds for networks trained by gradient flow. Since gradient flow is only an idealization of a practical method, this paper establishes analogous results for networks trained by gradient descent.

Subjects:	Machine Learning (cs.LG)
MSC classes:	41A46, 65K10, 68T07
Cite as:	arXiv:2405.11696 [cs.LG]
	(or arXiv:2405.11696v1 [cs.LG] for this version)

Submission history

From: Gerrit Welper [view email]
[v1] Sun, 19 May 2024 23:04:09 GMT (19kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2405.11696

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Approximation and Gradient Descent Training with Neural Networks

Submission history