Scalable Planning and Learning for Multiagent POMDPs

Amato, Christopher; Oliehoek, Frans A.

Full-text links:

Download:

Current browse context:

cs.AI

< prev | next >

new | recent | 1404

Computer Science > Artificial Intelligence

Title: Scalable Planning and Learning for Multiagent POMDPs

Authors: Christopher Amato, Frans A. Oliehoek

(Submitted on 4 Apr 2014 (this version), latest version 20 Dec 2014 (v2))

Abstract: Bayesian methods for reinforcement learning (BRL) allow model uncertainty to be considered explicitly and offer a principled way of dealing with the exploration/exploitation tradeoff. However, for multiagent systems there have been few such approaches, and none of them apply to problems with state uncertainty. In this paper, we fill this gap by proposing a BRL framework for multiagent partially observable Markov decision processes. It considers a team of agents that operates in a centralized fashion, but has uncertainty about both the state and the model of the environment, essentially transforming the learning problem to a planning problem. To deal with the complexity of this planning problem as well as other planning problems with a large number of actions and observations, we propose a novel scalable approach based on sample-based planning and factored value functions that exploits structure present in many multiagent settings. Experimental results show that we are able to provide high quality solutions to large problems even with a large amount of initial model uncertainty. We also show that our approach applies in the (traditional) planning setting, demonstrating significantly more efficient planning in factored settings.

Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1404.1140 [cs.AI]
	(or arXiv:1404.1140v1 [cs.AI] for this version)

Submission history

From: Christopher Amato [view email]
[v1] Fri, 4 Apr 2014 03:02:44 GMT (1118kb,D)
[v2] Sat, 20 Dec 2014 03:28:34 GMT (1428kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1404.1140v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Artificial Intelligence

Title: Scalable Planning and Learning for Multiagent POMDPs

Submission history