True Asymptotic Natural Gradient Optimization

Ollivier, Yann

Full-text links:

Download:

Current browse context:

stat

< prev | next >

new | recent | 1712

Statistics > Machine Learning

Title: True Asymptotic Natural Gradient Optimization

Authors: Yann Ollivier

(Submitted on 22 Dec 2017)

Abstract: We introduce a simple algorithm, True Asymptotic Natural Gradient Optimization (TANGO), that converges to a true natural gradient descent in the limit of small learning rates, without explicit Fisher matrix estimation.
For quadratic models the algorithm is also an instance of averaged stochastic gradient, where the parameter is a moving average of a "fast", constant-rate gradient descent. TANGO appears as a particular de-linearization of averaged SGD, and is sometimes quite different on non-quadratic models. This further connects averaged SGD and natural gradient, both of which are arguably optimal asymptotically.
In large dimension, small learning rates will be required to approximate the natural gradient well. Still, this shows it is possible to get arbitrarily close to exact natural gradient descent with a lightweight algorithm.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as:	arXiv:1712.08449 [stat.ML]
	(or arXiv:1712.08449v1 [stat.ML] for this version)

Submission history

From: Yann Ollivier [view email]
[v1] Fri, 22 Dec 2017 14:04:04 GMT (75kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1712.08449

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: True Asymptotic Natural Gradient Optimization

Submission history