We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Machine Learning

Title: Online Natural Gradient as a Kalman Filter

Authors: Yann Ollivier
Abstract: We review the relationship between Kalman filtering and Amari's natural gradient in statistical learning. Namely, using an online natural gradient descent on data log-likelihood to evaluate the parameter of a probabilistic model given a series of observations, is exactly equivalent to using an extended Kalman filter to estimate the parameter (assumed to have constant dynamics). In the non-recurrent case, this relation is a consequence of the "information filter" phrasing of the Kalman filter.
In the recurrent case, we prove that the joint Kalman filter over states and parameters is a natural gradient on top of real-time recurrent learning (RTRL), a classical algorithm to train recurrent models.
This correspondence provides relevant settings for natural gradient hyperparameters such as learning rates or initialization and regularization of the Fisher information matrix.
Subjects: Machine Learning (stat.ML); Optimization and Control (math.OC)
Cite as: arXiv:1703.00209 [stat.ML]
  (or arXiv:1703.00209v1 [stat.ML] for this version)

Submission history

From: Yann Ollivier [view email]
[v1] Wed, 1 Mar 2017 10:13:52 GMT (26kb)
[v2] Thu, 27 Apr 2017 16:45:48 GMT (27kb)
[v3] Mon, 27 Aug 2018 18:45:10 GMT (33kb)

Link back to: arXiv, form interface, contact.