On the Influence of Momentum Acceleration on Online Learning

Yuan, Kun; Ying, Bicheng; Sayed, Ali H.

Full-text links:

Download:

Current browse context:

math.OC

< prev | next >

new | recent | 1603

Mathematics > Optimization and Control

Title: On the Influence of Momentum Acceleration on Online Learning

Authors: Kun Yuan, Bicheng Ying, Ali H. Sayed

(Submitted on 14 Mar 2016 (this version), latest version 12 Oct 2016 (v4))

Abstract: The article examines in some detail the convergence rate and mean-square-error performance of momentum stochastic gradient methods in the constant step-size case and slow adaptation regime. The results establish that momentum methods are equivalent to the standard stochastic gradient method with a re-scaled (larger) step-size value. The size of the re-scaling is determined by the value of the momentum parameter. The equivalence result is established for all time instants and not only in steady-state. The analysis is carried out for general risk functions, and is not limited to quadratic risks. One notable conclusion is that the well-known benefits of momentum constructions for deterministic optimization problems do not necessarily carry over to the stochastic (online) setting when adaptation becomes necessary and when the true gradient vectors are not known beforehand. The analysis also suggests a method to retain some of the advantages of the momentum construction by employing a decaying momentum parameter, as opposed to a decaying step-size. In this way, the enhanced convergence rate during the initial stages of adaptation is preserved without the often-observed degradation in MSD performance.

Comments:	50 pages, 8 figures
Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1603.04136 [math.OC]
	(or arXiv:1603.04136v1 [math.OC] for this version)

Submission history

From: Kun Yuan [view email]
[v1] Mon, 14 Mar 2016 05:05:54 GMT (2598kb)
[v2] Tue, 29 Mar 2016 06:27:47 GMT (2598kb)
[v3] Mon, 1 Aug 2016 23:18:27 GMT (318kb)
[v4] Wed, 12 Oct 2016 05:19:07 GMT (551kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> math > arXiv:1603.04136v1

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Mathematics > Optimization and Control

Title: On the Influence of Momentum Acceleration on Online Learning

Submission history