Guarantees for Tuning the Step Size using a Learning-to-Learn Approach

Wang, Xiang; Yuan, Shuai; Wu, Chenwei; Ge, Rong

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 2006

Statistics > Machine Learning

Title: Guarantees for Tuning the Step Size using a Learning-to-Learn Approach

Authors: Xiang Wang, Shuai Yuan, Chenwei Wu, Rong Ge

(Submitted on 30 Jun 2020 (this version), latest version 11 Jun 2021 (v2))

Abstract: Learning-to-learn (using optimization algorithms to learn a new optimizer) has successfully trained efficient optimizers in practice. This approach relies on meta-gradient descent on a meta-objective based on the trajectory that the optimizer generates. However, there were few theoretical guarantees on how to avoid meta-gradient explosion/vanishing problems, or how to train an optimizer with good generalization performance. In this paper, we study the learning-to-learn approach on a simple problem of tuning the step size for quadratic loss. Our results show that although there is a way to design the meta-objective so that the meta-gradient remain polynomially bounded, computing the meta-gradient directly using backpropagation leads to numerical issues that look similar to gradient explosion/vanishing problems. We also characterize when it is necessary to compute the meta-objective on a separate validation set instead of the original training set. Finally, we verify our results empirically and show that a similar phenomenon appears even for more complicated learned optimizers parametrized by neural networks.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2006.16495 [stat.ML]
	(or arXiv:2006.16495v1 [stat.ML] for this version)

Submission history

From: Xiang Wang [view email]
[v1] Tue, 30 Jun 2020 02:59:35 GMT (256kb)
[v2] Fri, 11 Jun 2021 04:21:42 GMT (1355kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:2006.16495v1

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Guarantees for Tuning the Step Size using a Learning-to-Learn Approach

Submission history