We gratefully acknowledge support from
the Simons Foundation and member institutions.

Multiagent Systems

Authors and titles for cs.MA in Oct 2021, skipping first 25

[ total of 91 entries: 1-25 | 26-50 | 51-75 | 76-91 ]
[ showing 25 entries per page: fewer | more | all ]
[26]  arXiv:2110.00247 (cross-list from cs.AI) [pdf]
Title: Learner to learner fuzzy profiles similarity using a hybrid interaction analysis grid
Authors: Chabane Khentout, Khadidja Harbouche, Mahieddine Djoudi (TECHNÉ - EA 6316)
Journal-ref: Revue des Sciences et Technologies de l'Information - S{\'e}rie ISI : Ing{\'e}nierie des Syst{\`e}mes d'Information, Lavoisier, 2021, 26 (4), pp.375-386
Subjects: Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Multiagent Systems (cs.MA)
[27]  arXiv:2110.00304 (cross-list from cs.LG) [pdf, other]
Title: Divergence-Regularized Multi-Agent Actor-Critic
Authors: Kefan Su, Zongqing Lu
Comments: ICML 2022, 24 pages, 10 figures
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[28]  arXiv:2110.00751 (cross-list from cs.LG) [pdf, other]
Title: Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams
Comments: 14 pages, 13 figures. To be presented at "Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI) 2022". Also presented at "Artificial Intelligence for Human-Robot Interaction (AI-HRI) at AAAI Fall Symposium Series 2021"
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA); Robotics (cs.RO); Machine Learning (stat.ML)
[29]  arXiv:2110.00760 (cross-list from cs.RO) [pdf, other]
Title: AB-Mapper: Attention and BicNet Based Multi-agent Path Finding for Dynamic Crowded Environment
Subjects: Robotics (cs.RO); Multiagent Systems (cs.MA)
[30]  arXiv:2110.01266 (cross-list from cs.LG) [pdf, ps, other]
Title: Behaviour-conditioned policies for cooperative reinforcement learning tasks
Authors: Antti Keurulainen (1 and 3), Isak Westerlund (3), Ariel Kwiatkowski (3), Samuel Kaski (1 and 2), Alexander Ilin (1) ((1) Helsinki Institute for Information Technology HIIT, Department of Computer Science, Aalto University, (2) Department of Computer Science, University of Manchester, (3) Bitville Oy, Espoo, Finland)
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[31]  arXiv:2110.01307 (cross-list from cs.AI) [pdf, other]
Title: Collective eXplainable AI: Explaining Cooperative Strategies and Agent Contribution in Multiagent Reinforcement Learning with Shapley Values
Comments: Submitted to IEEE Computational Intelligence Magazine
Subjects: Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[32]  arXiv:2110.02134 (cross-list from cs.GT) [pdf, other]
Title: Stochastic Multiplicative Weights Updates in Zero-Sum Games
Subjects: Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA)
[33]  arXiv:2110.02355 (cross-list from cs.GT) [pdf, other]
Title: Robustness and sample complexity of model-based MARL for general-sum Markov games
Subjects: Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA); Systems and Control (eess.SY); Optimization and Control (math.OC)
[34]  arXiv:2110.02482 (cross-list from cs.GT) [pdf, other]
Title: $O\left(1/T\right)$ Time-Average Convergence in a Generalization of Multiagent Zero-Sum Games
Authors: James P. Bailey
Subjects: Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA)
[35]  arXiv:2110.02793 (cross-list from cs.AI) [pdf, other]
Title: Multi-Agent Constrained Policy Optimisation
Subjects: Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[36]  arXiv:2110.02924 (cross-list from cs.LG) [pdf, other]
Title: No-Press Diplomacy from Scratch
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA)
[37]  arXiv:2110.03017 (cross-list from cs.LG) [pdf, other]
Title: Two-Bit Aggregation for Communication Efficient and Differentially Private Federated Learning
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC); Multiagent Systems (cs.MA)
[38]  arXiv:2110.03604 (cross-list from cs.LG) [pdf, ps, other]
Title: Online Markov Decision Processes with Non-oblivious Strategic Adversary
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA)
[39]  arXiv:2110.03906 (cross-list from cs.GT) [pdf, other]
Title: Nash Convergence of Mean-Based Learning Algorithms in First Price Auctions
Comments: 38 pages, 5 figures
Subjects: Computer Science and Game Theory (cs.GT); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA); Theoretical Economics (econ.TH)
[40]  arXiv:2110.04321 (cross-list from cs.GT) [pdf, other]
Title: Computing an Optimal Pitching Strategy in a Baseball At-Bat
Subjects: Computer Science and Game Theory (cs.GT); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[41]  arXiv:2110.04495 (cross-list from cs.LG) [pdf, other]
Title: Multi-Agent MDP Homomorphic Networks
Comments: Camera ready version
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[42]  arXiv:2110.04753 (cross-list from cs.GT) [pdf, other]
Title: Transaction Fees on a Honeymoon: Ethereum's EIP-1559 One Month Later
Comments: IEEE Blockchain-2021, The 4th IEEE International Conference on Blockchain, Melbourne, Australia | 06-08 December 2021
Subjects: Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA); Social and Information Networks (cs.SI); Dynamical Systems (math.DS)
[43]  arXiv:2110.05422 (cross-list from cs.CL) [pdf, other]
Title: Calibrate your listeners! Robust communication-based training for pragmatic speakers
Comments: Findings of EMNLP 2021 Code: this https URL
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[44]  arXiv:2110.05597 (cross-list from cs.LG) [pdf, other]
Title: Learning to Coordinate in Multi-Agent Systems: A Coordinated Actor-Critic Algorithm and Finite-Time Guarantees
Subjects: Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[45]  arXiv:2110.05682 (cross-list from cs.LG) [pdf, other]
Title: Provably Efficient Reinforcement Learning in Decentralized General-Sum Markov Games
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[46]  arXiv:2110.05707 (cross-list from cs.LG) [pdf, other]
Title: On Improving Model-Free Algorithms for Decentralized Multi-Agent Reinforcement Learning
Subjects: Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Multiagent Systems (cs.MA)
[47]  arXiv:2110.05734 (cross-list from cs.CV) [pdf, other]
Title: Learning Efficient Multi-Agent Cooperative Visual Exploration
Comments: First three authors share equal contribution
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[48]  arXiv:2110.05769 (cross-list from cs.CV) [pdf, other]
Title: Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents
Comments: Project page: this https URL ; the first three authors contributed equally
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multiagent Systems (cs.MA)
[49]  arXiv:2110.06342 (cross-list from cs.RO) [pdf, other]
Title: Decentralized Connectivity Maintenance for Multi-robot Systems Under Motion and Sensing Uncertainties
Comments: Submitted to NAVIGATION: Journal of The Institute of Navigation
Subjects: Robotics (cs.RO); Multiagent Systems (cs.MA)
[50]  arXiv:2110.06407 (cross-list from cs.DC) [pdf, ps, other]
Title: Efficient Linearizability Checking for Actor-based Systems
Comments: This article was submitted around Feb 2021 to the journal of "Software Practice and Experience" and not yet finished the review process. So they allow us to submit it to one more personal archival service e.g. this one. That is, this is unpublished work yet
Subjects: Distributed, Parallel, and Cluster Computing (cs.DC); Multiagent Systems (cs.MA); Software Engineering (cs.SE)
[ total of 91 entries: 1-25 | 26-50 | 51-75 | 76-91 ]
[ showing 25 entries per page: fewer | more | all ]

Disable MathJax (What is MathJax?)

Links to: arXiv, form interface, find, cs, 2206, contact, help  (Access key information)