We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Approximation and Gradient Descent Training with Neural Networks

Authors: G. Welper
Abstract: It is well understood that neural networks with carefully hand-picked weights provide powerful function approximation and that they can be successfully trained in over-parametrized regimes. Since over-parametrization ensures zero training error, these two theories are not immediately compatible. Recent work uses the smoothness that is required for approximation results to extend a neural tangent kernel (NTK) optimization argument to an under-parametrized regime and show direct approximation bounds for networks trained by gradient flow. Since gradient flow is only an idealization of a practical method, this paper establishes analogous results for networks trained by gradient descent.
Subjects: Machine Learning (cs.LG)
MSC classes: 41A46, 65K10, 68T07
Cite as: arXiv:2405.11696 [cs.LG]
  (or arXiv:2405.11696v1 [cs.LG] for this version)

Submission history

From: Gerrit Welper [view email]
[v1] Sun, 19 May 2024 23:04:09 GMT (19kb,D)

Link back to: arXiv, form interface, contact.