Learning Markov State Abstractions for Deep Reinforcement Learning

Allen, Cameron; Parikh, Neev; Gottesman, Omer; Konidaris, George

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2106

Computer Science > Machine Learning

Title: Learning Markov State Abstractions for Deep Reinforcement Learning

Authors: Cameron Allen, Neev Parikh, Omer Gottesman, George Konidaris

(Submitted on 8 Jun 2021 (v1), last revised 15 Mar 2024 (this version, v4))

Abstract: A fundamental assumption of reinforcement learning in Markov decision processes (MDPs) is that the relevant decision process is, in fact, Markov. However, when MDPs have rich observations, agents typically learn by way of an abstract state representation, and such representations are not guaranteed to preserve the Markov property. We introduce a novel set of conditions and prove that they are sufficient for learning a Markov abstract state representation. We then describe a practical training procedure that combines inverse model estimation and temporal contrastive learning to learn an abstraction that approximately satisfies these conditions. Our novel training objective is compatible with both online and offline training: it does not require a reward signal, but agents can capitalize on reward information when available. We empirically evaluate our approach on a visual gridworld domain and a set of continuous control benchmarks. Our approach learns representations that capture the underlying structure of the domain and lead to improved sample efficiency over state-of-the-art deep reinforcement learning with visual features -- often matching or exceeding the performance achieved with hand-designed compact state information.

Comments:	Fixed typo (see Errata). Code available at this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2106.04379 [cs.LG]
	(or arXiv:2106.04379v4 [cs.LG] for this version)

Submission history

From: Cameron Allen [view email]
[v1] Tue, 8 Jun 2021 14:12:36 GMT (7108kb,D)
[v2] Tue, 26 Oct 2021 20:50:59 GMT (6258kb,D)
[v3] Thu, 28 Oct 2021 01:37:34 GMT (6258kb,D)
[v4] Fri, 15 Mar 2024 00:13:09 GMT (6291kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2106.04379

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Learning Markov State Abstractions for Deep Reinforcement Learning

Submission history