We gratefully acknowledge support from
the Simons Foundation and member institutions.

Multiagent Systems

New submissions

[ total of 5 entries: 1-5 ]
[ showing up to 2000 entries per page: fewer | more ]

New submissions for Fri, 9 Jun 23

[1]  arXiv:2306.04781 [pdf, other]
Title: Learning to Navigate in Turbulent Flows with Aerial Robot Swarms: A Cooperative Deep Reinforcement Learning Approach
Subjects: Robotics (cs.RO); Machine Learning (cs.LG); Multiagent Systems (cs.MA)

Aerial operation in turbulent environments is a challenging problem due to the chaotic behavior of the flow. This problem is made even more complex when a team of aerial robots is trying to achieve coordinated motion in turbulent wind conditions. In this paper, we present a novel multi-robot controller to navigate in turbulent flows, decoupling the trajectory-tracking control from the turbulence compensation via a nested control architecture. Unlike previous works, our method does not learn to compensate for the air-flow at a specific time and space. Instead, our method learns to compensate for the flow based on its effect on the team. This is made possible via a deep reinforcement learning approach, implemented via a Graph Convolutional Neural Network (GCNN)-based architecture, which enables robots to achieve better wind compensation by processing the spatial-temporal correlation of wind flows across the team. Our approach scales well to large robot teams -- as each robot only uses information from its nearest neighbors -- , and generalizes well to robot teams larger than seen in training. Simulated experiments demonstrate how information sharing improves turbulence compensation in a team of aerial robots and demonstrate the flexibility of our method over different team configurations.

[2]  arXiv:2306.05028 [pdf, other]
Title: Condorcet Markets
Subjects: Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA)

The paper studies information markets about single events from an epistemic social choice perspective. Within the classical Condorcet error model for collective binary decisions, we establish equivalence results between elections and markets, showing that the alternative that would be selected by weighed majority voting (under specific weighting schemes) corresponds to the alternative with highest price in the equilibrium of the market (under specific assumptions on the market type). This makes it possible to implement specific weighted majority elections, which are known to have superior truth-tracking performance, through information markets and, crucially, without needing to elicit voters' competences.

[3]  arXiv:2306.05353 [pdf, other]
Title: Negotiated Reasoning: On Provably Addressing Relative Over-Generalization
Comments: 21 pages
Subjects: Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)

Over-generalization is a thorny issue in cognitive science, where people may become overly cautious due to past experiences. Agents in multi-agent reinforcement learning (MARL) also have been found to suffer relative over-generalization (RO) as people do and stuck to sub-optimal cooperation. Recent methods have shown that assigning reasoning ability to agents can mitigate RO algorithmically and empirically, but there has been a lack of theoretical understanding of RO, let alone designing provably RO-free methods. This paper first proves that RO can be avoided when the MARL method satisfies a consistent reasoning requirement under certain conditions. Then we introduce a novel reasoning framework, called negotiated reasoning, that first builds the connection between reasoning and RO with theoretical justifications. After that, we propose an instantiated algorithm, Stein variational negotiated reasoning (SVNR), which uses Stein variational gradient descent to derive a negotiation policy that provably avoids RO in MARL under maximum entropy policy iteration. The method is further parameterized with neural networks for amortized learning, making computation efficient. Numerical experiments on many RO-challenged environments demonstrate the superiority and efficiency of SVNR compared to state-of-the-art methods in addressing RO.

Replacements for Fri, 9 Jun 23

[4]  arXiv:2201.01247 (replaced) [pdf, other]
Title: Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients
Comments: Accepted to IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI)
Subjects: Multiagent Systems (cs.MA); Machine Learning (cs.LG)
[5]  arXiv:2210.17101 (replaced) [pdf, other]
Title: Unrolled Graph Learning for Multi-Agent Collaboration
Comments: This work was accepted to be presented at the Graph Signal Processing Workshop 2023
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[ total of 5 entries: 1-5 ]
[ showing up to 2000 entries per page: fewer | more ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, recent, 2306, contact, help  (Access key information)