We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Approximation by Combinations of ReLU and Squared ReLU Ridge Functions with $ \ell^1 $ and $ \ell^0 $ Controls

Abstract: We establish $ L^{\infty} $ and $ L^2 $ error bounds for functions of many variables that are approximated by linear combinations of ReLU (rectified linear unit) and squared ReLU ridge functions with $ \ell^1 $ and $ \ell^0 $ controls on their inner and outer parameters. With the squared ReLU ridge function, we show that the $ L^2 $ approximation error is inversely proportional to the inner layer $ \ell^0 $ sparsity and it need only be sublinear in the outer layer $ \ell^0 $ sparsity. Our constructions are obtained using a variant of the Jones-Barron probabilistic method, which can be interpreted as either stratified sampling with proportionate allocation or two-stage cluster sampling. We also provide companion error lower bounds that reveal near optimality of our constructions. Despite the sparsity assumptions, we showcase the richness and flexibility of these ridge combinations by defining a large family of functions, in terms of certain spectral conditions, that are particularly well approximated by them.
Subjects: Machine Learning (stat.ML); Statistics Theory (math.ST)
MSC classes: 62M45, 41A15
Cite as: arXiv:1607.07819 [stat.ML]
  (or arXiv:1607.07819v3 [stat.ML] for this version)

Submission history

From: Jason Klusowski M [view email]
[v1] Tue, 26 Jul 2016 17:52:00 GMT (11kb)
[v2] Mon, 8 Aug 2016 14:41:26 GMT (11kb)
[v3] Wed, 23 May 2018 22:02:46 GMT (18kb)

Link back to: arXiv, form interface, contact.