We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Convex Geometry and Duality of Over-parameterized Neural Networks

Abstract: We develop a convex analytic approach to analyze finite width two-layer ReLU networks. We first prove that an optimal solution to the regularized training problem can be characterized as extreme points of a convex set, where simple solutions are encouraged via its convex geometrical properties. We then leverage this characterization to show that an optimal set of parameters yield linear spline interpolation for regression problems involving one dimensional or rank-one data. We also characterize the classification decision regions in terms of a kernel matrix and minimum $\ell_1$-norm solutions. This is in contrast to Neural Tangent Kernel which is unable to explain predictions of finite width networks. Our convex geometric characterization also provides intuitive explanations of hidden neurons as auto-encoders. In higher dimensions, we show that the training problem can be cast as a finite dimensional convex problem with infinitely many constraints. Then, we apply certain convex relaxations and introduce a cutting-plane algorithm to globally optimize the network. We further analyze the exactness of the relaxations to provide conditions for the convergence to a global optimum. Our analysis also shows that optimal network parameters can be also characterized as interpretable closed-form formulas in some practically relevant special cases.
Comments: Accepted to the Journal of Machine Learning Research (JMLR)
Subjects: Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as: arXiv:2002.11219 [cs.LG]
  (or arXiv:2002.11219v4 [cs.LG] for this version)

Submission history

From: Tolga Ergen [view email]
[v1] Tue, 25 Feb 2020 23:05:33 GMT (2338kb)
[v2] Sat, 11 Apr 2020 22:41:11 GMT (2447kb)
[v3] Thu, 24 Dec 2020 06:33:08 GMT (2664kb)
[v4] Tue, 31 Aug 2021 02:13:00 GMT (2093kb)

Link back to: arXiv, form interface, contact.