References & Citations
Computer Science > Artificial Intelligence
Title: Scalable Planning and Learning for Multiagent POMDPs
(Submitted on 4 Apr 2014 (this version), latest version 20 Dec 2014 (v2))
Abstract: Bayesian methods for reinforcement learning (BRL) allow model uncertainty to be considered explicitly and offer a principled way of dealing with the exploration/exploitation tradeoff. However, for multiagent systems there have been few such approaches, and none of them apply to problems with state uncertainty. In this paper, we fill this gap by proposing a BRL framework for multiagent partially observable Markov decision processes. It considers a team of agents that operates in a centralized fashion, but has uncertainty about both the state and the model of the environment, essentially transforming the learning problem to a planning problem. To deal with the complexity of this planning problem as well as other planning problems with a large number of actions and observations, we propose a novel scalable approach based on sample-based planning and factored value functions that exploits structure present in many multiagent settings. Experimental results show that we are able to provide high quality solutions to large problems even with a large amount of initial model uncertainty. We also show that our approach applies in the (traditional) planning setting, demonstrating significantly more efficient planning in factored settings.
Submission history
From: Christopher Amato [view email][v1] Fri, 4 Apr 2014 03:02:44 GMT (1118kb,D)
[v2] Sat, 20 Dec 2014 03:28:34 GMT (1428kb,D)
Link back to: arXiv, form interface, contact.