Automatic, Dynamic, and Nearly Optimal Learning Rate Specification by Local Quadratic Approximation

Zhu, Yingqiu; Chen, Yu; Huang, Danyang; Zhang, Bo; Wang, Hansheng

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 2004

Statistics > Machine Learning

Title: Automatic, Dynamic, and Nearly Optimal Learning Rate Specification by Local Quadratic Approximation

Authors: Yingqiu Zhu, Yu Chen, Danyang Huang, Bo Zhang, Hansheng Wang

(Submitted on 7 Apr 2020)

Abstract: In deep learning tasks, the learning rate determines the update step size in each iteration, which plays a critical role in gradient-based optimization. However, the determination of the appropriate learning rate in practice typically replies on subjective judgement. In this work, we propose a novel optimization method based on local quadratic approximation (LQA). In each update step, given the gradient direction, we locally approximate the loss function by a standard quadratic function of the learning rate. Then, we propose an approximation step to obtain a nearly optimal learning rate in a computationally efficient way. The proposed LQA method has three important features. First, the learning rate is automatically determined in each update step. Second, it is dynamically adjusted according to the current loss function value and the parameter estimates. Third, with the gradient direction fixed, the proposed method leads to nearly the greatest reduction in terms of the loss function. Extensive experiments have been conducted to prove the strengths of the proposed LQA method.

Comments:	10 pages, 5 figures
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
MSC classes:	62-08, 41A99
ACM classes:	G.0; I.0
Cite as:	arXiv:2004.03260 [stat.ML]
	(or arXiv:2004.03260v1 [stat.ML] for this version)

Submission history

From: Danyang Huang [view email]
[v1] Tue, 7 Apr 2020 10:55:12 GMT (795kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:2004.03260

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Automatic, Dynamic, and Nearly Optimal Learning Rate Specification by Local Quadratic Approximation

Submission history