Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

Iqbal, Shariq; de Witt, Christian A. Schroeder; Peng, Bei; Böhmer, Wendelin; Whiteson, Shimon; Sha, Fei

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2006

Computer Science > Machine Learning

Title: Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

Authors: Shariq Iqbal, Christian A. Schroeder de Witt, Bei Peng, Wendelin Böhmer, Shimon Whiteson, Fei Sha

(Submitted on 7 Jun 2020 (v1), last revised 11 Jun 2021 (this version, v3))

Abstract: Multi-agent settings in the real world often involve tasks with varying types and quantities of agents and non-agent entities; however, common patterns of behavior often emerge among these agents/entities. Our method aims to leverage these commonalities by asking the question: ``What is the expected utility of each agent when only considering a randomly selected sub-group of its observed entities?'' By posing this counterfactual question, we can recognize state-action trajectories within sub-groups of entities that we may have encountered in another task and use what we learned in that task to inform our prediction in the current one. We then reconstruct a prediction of the full returns as a combination of factors considering these disjoint groups of entities and train this ``randomly factorized" value function as an auxiliary objective for value-based multi-agent reinforcement learning. By doing so, our model can recognize and leverage similarities across tasks to improve learning efficiency in a multi-task setting. Our approach, Randomized Entity-wise Factorization for Imagined Learning (REFIL), outperforms all strong baselines by a significant margin in challenging multi-task StarCraft micromanagement settings.

Comments:	ICML 2021 Camera Ready
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Machine Learning (stat.ML)
Cite as:	arXiv:2006.04222 [cs.LG]
	(or arXiv:2006.04222v3 [cs.LG] for this version)

Submission history

From: Shariq Iqbal [view email]
[v1] Sun, 7 Jun 2020 18:28:41 GMT (3111kb,D)
[v2] Wed, 21 Oct 2020 16:39:47 GMT (4103kb,D)
[v3] Fri, 11 Jun 2021 18:53:47 GMT (3443kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2006.04222

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

Submission history