Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision

Mezghani, Lina; Sukhbaatar, Sainbayar; Bojanowski, Piotr; Alahari, Karteek

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2206

Computer Science > Machine Learning

Title: Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision

Authors: Lina Mezghani, Sainbayar Sukhbaatar, Piotr Bojanowski, Karteek Alahari

(Submitted on 23 Jun 2022)

Abstract: Learning a diverse set of skills by interacting with an environment without any external supervision is an important challenge. In particular, obtaining a goal-conditioned agent that can reach any given state is useful in many applications. We propose a novel method for training such a goal-conditioned agent without any external rewards or any domain knowledge. We use random walk to train a reachability network that predicts the similarity between two states. This reachability network is then used in building goal memory containing past observations that are diverse and well-balanced. Finally, we train a goal-conditioned policy network with goals sampled from the goal memory and reward it by the reachability network and the goal memory. All the components are kept updated throughout training as the agent discovers and learns new goals. We apply our method to a continuous control navigation and robotic manipulation tasks.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2206.11733 [cs.LG]
	(or arXiv:2206.11733v1 [cs.LG] for this version)

Submission history

From: Lina Mezghani [view email]
[v1] Thu, 23 Jun 2022 14:29:36 GMT (3804kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2206.11733

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Walk the Random Walk: Learning to Discover and Reach Goals Without Supervision

Submission history