Soft Hierarchical Graph Recurrent Networks for Many-Agent Partially Observable Environments

Ye, Zhenhui; Jiang, Xiaohong; Song, Guanghua; Yang, Bowei

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2109

Computer Science > Machine Learning

Title: Soft Hierarchical Graph Recurrent Networks for Many-Agent Partially Observable Environments

Authors: Zhenhui Ye, Xiaohong Jiang, Guanghua Song, Bowei Yang

(Submitted on 5 Sep 2021)

Abstract: The recent progress in multi-agent deep reinforcement learning(MADRL) makes it more practical in real-world tasks, but its relatively poor scalability and the partially observable constraints raise challenges to its performance and deployment. Based on our intuitive observation that the human society could be regarded as a large-scale partially observable environment, where each individual has the function of communicating with neighbors and remembering its own experience, we propose a novel network structure called hierarchical graph recurrent network(HGRN) for multi-agent cooperation under partial observability. Specifically, we construct the multi-agent system as a graph, use the hierarchical graph attention network(HGAT) to achieve communication between neighboring agents, and exploit GRU to enable agents to record historical information. To encourage exploration and improve robustness, we design a maximum-entropy learning method to learn stochastic policies of a configurable target action entropy. Based on the above technologies, we proposed a value-based MADRL algorithm called Soft-HGRN and its actor-critic variant named SAC-HRGN. Experimental results based on three homogeneous tasks and one heterogeneous environment not only show that our approach achieves clear improvements compared with four baselines, but also demonstrates the interpretability, scalability, and transferability of the proposed model. Ablation studies prove the function and necessity of each component.

Comments:	9 pages, 6 figures, 1 table. Under review
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
Cite as:	arXiv:2109.02032 [cs.LG]
	(or arXiv:2109.02032v1 [cs.LG] for this version)

Submission history

From: Zhenhui Ye [view email]
[v1] Sun, 5 Sep 2021 09:51:25 GMT (876kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2109.02032

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Soft Hierarchical Graph Recurrent Networks for Many-Agent Partially Observable Environments

Submission history