Online Natural Gradient as a Kalman Filter

Ollivier, Yann

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 1703

Statistics > Machine Learning

Title: Online Natural Gradient as a Kalman Filter

Authors: Yann Ollivier

(Submitted on 1 Mar 2017 (this version), latest version 27 Aug 2018 (v3))

Abstract: We review the relationship between Kalman filtering and Amari's natural gradient in statistical learning. Namely, using an online natural gradient descent on data log-likelihood to evaluate the parameter of a probabilistic model given a series of observations, is exactly equivalent to using an extended Kalman filter to estimate the parameter (assumed to have constant dynamics). In the non-recurrent case, this relation is a consequence of the "information filter" phrasing of the Kalman filter.
In the recurrent case, we prove that the joint Kalman filter over states and parameters is a natural gradient on top of real-time recurrent learning (RTRL), a classical algorithm to train recurrent models.
This correspondence provides relevant settings for natural gradient hyperparameters such as learning rates or initialization and regularization of the Fisher information matrix.

Subjects:	Machine Learning (stat.ML); Optimization and Control (math.OC)
Cite as:	arXiv:1703.00209 [stat.ML]
	(or arXiv:1703.00209v1 [stat.ML] for this version)

Submission history

From: Yann Ollivier [view email]
[v1] Wed, 1 Mar 2017 10:13:52 GMT (26kb)
[v2] Thu, 27 Apr 2017 16:45:48 GMT (27kb)
[v3] Mon, 27 Aug 2018 18:45:10 GMT (33kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1703.00209v1

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Online Natural Gradient as a Kalman Filter

Submission history