Learning Efficient Multi-Agent Cooperative Visual Exploration

Yu, Chao; Yang, Xinyi; Gao, Jiaxuan; Yang, Huazhong; Wang, Yu; Wu, Yi

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2110

Computer Science > Computer Vision and Pattern Recognition

Title: Learning Efficient Multi-Agent Cooperative Visual Exploration

Authors: Chao Yu, Xinyi Yang, Jiaxuan Gao, Huazhong Yang, Yu Wang, Yi Wu

(Submitted on 12 Oct 2021 (v1), last revised 22 Nov 2022 (this version, v3))

Abstract: We tackle the problem of cooperative visual exploration where multiple agents need to jointly explore unseen regions as fast as possible based on visual signals. Classical planning-based methods often suffer from expensive computation overhead at each step and a limited expressiveness of complex cooperation strategy. By contrast, reinforcement learning (RL) has recently become a popular paradigm for tackling this challenge due to its modeling capability of arbitrarily complex strategies and minimal inference overhead. In this paper, we extend the state-of-the-art single-agent visual navigation method, Active Neural SLAM (ANS), to the multi-agent setting by introducing a novel RL-based planning module, Multi-agent Spatial Planner (MSP).MSP leverages a transformer-based architecture, Spatial-TeamFormer, which effectively captures spatial relations and intra-agent interactions via hierarchical spatial self-attentions. In addition, we also implement a few multi-agent enhancements to process local information from each agent for an aligned spatial representation and more precise planning. Finally, we perform policy distillation to extract a meta policy to significantly improve the generalization capability of final policy. We call this overall solution, Multi-Agent Active Neural SLAM (MAANS). MAANS substantially outperforms classical planning-based baselines for the first time in a photo-realistic 3D simulator, Habitat. Code and videos can be found at this https URL

Comments:	First three authors share equal contribution. This paper has been accepted by ECCV (this https URL)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
Cite as:	arXiv:2110.05734 [cs.CV]
	(or arXiv:2110.05734v3 [cs.CV] for this version)

Submission history

From: Chao Yu [view email]
[v1] Tue, 12 Oct 2021 04:48:10 GMT (4862kb,D)
[v2] Tue, 22 Mar 2022 15:09:52 GMT (4704kb,D)
[v3] Tue, 22 Nov 2022 14:26:46 GMT (4704kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2110.05734

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Learning Efficient Multi-Agent Cooperative Visual Exploration

Submission history