Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning

Misra, Dipendra; Henaff, Mikael; Krishnamurthy, Akshay; Langford, John

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 1911

Computer Science > Machine Learning

Title: Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning

Authors: Dipendra Misra, Mikael Henaff, Akshay Krishnamurthy, John Langford

(Submitted on 13 Nov 2019)

Abstract: We present an algorithm, HOMER, for exploration and reinforcement learning in rich observation environments that are summarizable by an unknown latent state space. The algorithm interleaves representation learning to identify a new notion of kinematic state abstraction with strategic exploration to reach new states using the learned abstraction. The algorithm provably explores the environment with sample complexity scaling polynomially in the number of latent states and the time horizon, and, crucially, with no dependence on the size of the observation space, which could be infinitely large. This exploration guarantee further enables sample-efficient global policy optimization for any reward function. On the computational side, we show that the algorithm can be implemented efficiently whenever certain supervised learning problems are tractable. Empirically, we evaluate HOMER on a challenging exploration problem, where we show that the algorithm is exponentially more sample efficient than standard reinforcement learning baselines.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1911.05815 [cs.LG]
	(or arXiv:1911.05815v1 [cs.LG] for this version)

Submission history

From: Dipendra Misra [view email]
[v1] Wed, 13 Nov 2019 21:07:44 GMT (3178kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1911.05815

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Computer Science > Machine Learning

Title: Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning

Submission history