We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:

References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Machine Learning

Title: Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments

Abstract: We explore deep reinforcement learning methods for multi-agent domains. We begin by analyzing the difficulty of traditional algorithms in the multi-agent case: Q-learning is challenged by an inherent non-stationarity of the environment, while policy gradient suffers from a variance that increases as the number of agents grows. We then present an adaptation of actor-critic methods that considers action policies of other agents and is able to successfully learn policies that require complex multi-agent coordination. Additionally, we introduce a training regimen utilizing an ensemble of policies for each agent that leads to more robust multi-agent policies. We show the strength of our approach compared to existing methods in cooperative as well as competitive scenarios, where agent populations are able to discover various physical and informational coordination strategies.
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
Cite as: arXiv:1706.02275 [cs.LG]
  (or arXiv:1706.02275v4 [cs.LG] for this version)

Submission history

From: Ryan Lowe T. [view email]
[v1] Wed, 7 Jun 2017 17:35:00 GMT (3408kb,D)
[v2] Wed, 21 Jun 2017 22:18:54 GMT (3408kb,D)
[v3] Tue, 16 Jan 2018 23:37:25 GMT (3409kb,D)
[v4] Sat, 14 Mar 2020 20:33:00 GMT (3419kb,D)

Link back to: arXiv, form interface, contact.