Optimal Reinforcement Learning for Gaussian Systems

Hennig, Philipp

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 1106

Statistics > Machine Learning

Title: Optimal Reinforcement Learning for Gaussian Systems

Authors: Philipp Hennig

(Submitted on 4 Jun 2011 (v1), last revised 14 Oct 2011 (this version, v3))

Abstract: The exploration-exploitation trade-off is among the central challenges of reinforcement learning. The optimal Bayesian solution is intractable in general. This paper studies to what extent analytic statements about optimal learning are possible if all beliefs are Gaussian processes. A first order approximation of learning of both loss and dynamics, for nonlinear, time-varying systems in continuous time and space, subject to a relatively weak restriction on the dynamics, is described by an infinite-dimensional partial differential equation. An approximate finite-dimensional projection gives an impression for how this result may be helpful.

Comments:	final pre-conference version of this NIPS 2011 paper. Once again, please note some nontrivial changes to exposition and interpretation of the results, in particular in Equation (9) and Eqs. 11-14. The algorithm and results have remained the same, but their theoretical interpretation has changed
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1106.0800 [stat.ML]
	(or arXiv:1106.0800v3 [stat.ML] for this version)

Submission history

From: Philipp Hennig PhD [view email]
[v1] Sat, 4 Jun 2011 08:14:59 GMT (2456kb,AD)
[v2] Wed, 7 Sep 2011 16:11:15 GMT (37kb,D)
[v3] Fri, 14 Oct 2011 15:01:11 GMT (39kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1106.0800

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Optimal Reinforcement Learning for Gaussian Systems

Submission history