Contrastive Variational Reinforcement Learning for Complex Observations

Ma, Xiao; Chen, Siwei; Hsu, David; Lee, Wee Sun

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2008

Computer Science > Machine Learning

Title: Contrastive Variational Reinforcement Learning for Complex Observations

Authors: Xiao Ma, Siwei Chen, David Hsu, Wee Sun Lee

(Submitted on 6 Aug 2020 (v1), last revised 9 Nov 2020 (this version, v2))

Abstract: Deep reinforcement learning (DRL) has achieved significant success in various robot tasks: manipulation, navigation, etc. However, complex visual observations in natural environments remains a major challenge. This paper presents Contrastive Variational Reinforcement Learning (CVRL), a model-based method that tackles complex visual observations in DRL. CVRL learns a contrastive variational model by maximizing the mutual information between latent states and observations discriminatively, through contrastive learning. It avoids modeling the complex observation space unnecessarily, as the commonly used generative observation model often does, and is significantly more robust. CVRL achieves comparable performance with state-of-the-art model-based DRL methods on standard Mujoco tasks. It significantly outperforms them on Natural Mujoco tasks and a robot box-pushing task with complex observations, e.g., dynamic shadows. The CVRL code is available publicly at this https URL

Comments:	CoRL 2020 camera ready
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2008.02430 [cs.LG]
	(or arXiv:2008.02430v2 [cs.LG] for this version)

Submission history

From: Xiao Ma [view email]
[v1] Thu, 6 Aug 2020 02:25:51 GMT (6445kb,D)
[v2] Mon, 9 Nov 2020 07:35:00 GMT (2231kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2008.02430

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Contrastive Variational Reinforcement Learning for Complex Observations

Submission history