Locality defeats the curse of dimensionality in convolutional teacher-student scenarios

Favero, Alessandro; Cagnetta, Francesco; Wyart, Matthieu

doi:10.1088/1742-5468/ac98ab

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 2106

Statistics > Machine Learning

Title: Locality defeats the curse of dimensionality in convolutional teacher-student scenarios

Authors: Alessandro Favero, Francesco Cagnetta, Matthieu Wyart

(Submitted on 16 Jun 2021 (v1), last revised 12 Nov 2021 (this version, v3))

Abstract: Convolutional neural networks perform a local and translationally-invariant treatment of the data: quantifying which of these two aspects is central to their success remains a challenge. We study this problem within a teacher-student framework for kernel regression, using `convolutional' kernels inspired by the neural tangent kernel of simple convolutional architectures of given filter size. Using heuristic methods from physics, we find in the ridgeless case that locality is key in determining the learning curve exponent $\beta$ (that relates the test error $\epsilon_t\sim P^{-\beta}$ to the size of the training set $P$), whereas translational invariance is not. In particular, if the filter size of the teacher $t$ is smaller than that of the student $s$, $\beta$ is a function of $s$ only and does not depend on the input dimension. We confirm our predictions on $\beta$ empirically. We conclude by proving, using a natural universality assumption, that performing kernel regression with a ridge that decreases with the size of the training set leads to similar learning curve exponents to those we obtain in the ridgeless case.

Comments:	32 pages, 7 figures
Subjects:	Machine Learning (stat.ML); Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG)
DOI:	10.1088/1742-5468/ac98ab
Cite as:	arXiv:2106.08619 [stat.ML]
	(or arXiv:2106.08619v3 [stat.ML] for this version)

Submission history

From: Francesco Cagnetta [view email]
[v1] Wed, 16 Jun 2021 08:27:31 GMT (101kb,D)
[v2] Mon, 1 Nov 2021 10:44:17 GMT (196kb,D)
[v3] Fri, 12 Nov 2021 13:32:27 GMT (196kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:2106.08619

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Locality defeats the curse of dimensionality in convolutional teacher-student scenarios

Submission history