We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.MA

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Multiagent Systems

Title: Improved Reinforcement Learning in Cooperative Multi-agent Environments Using Knowledge Transfer

Abstract: Nowadays, cooperative multi-agent systems are used to learn how to achieve goals in large-scale dynamic environments. However, learning in these environments is challenging: from the effect of search space size on learning time to inefficient cooperation among agents. Moreover, reinforcement learning algorithms may suffer from a long time of convergence in such environments. In this paper, a communication framework is introduced. In the proposed communication framework, agents learn to cooperate effectively and also by introduction of a new state calculation method the size of state space will decline considerably. Furthermore, a knowledge-transferring algorithm is presented to share the gained experiences among the different agents, and develop an effective knowledge-fusing mechanism to fuse the knowledge learnt utilizing the agents' own experiences with the knowledge received from other team members. Finally, the simulation results are provided to indicate the efficacy of the proposed method in the complex learning task. We have evaluated our approach on the shepherding problem and the results show that the learning process accelerates by making use of the knowledge transferring mechanism and the size of state space has declined by generating similar states based on state abstraction concept.
Comments: Accepted for publication by The Journal of Supercomputing
Subjects: Multiagent Systems (cs.MA); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as: arXiv:2107.09807 [cs.MA]
  (or arXiv:2107.09807v5 [cs.MA] for this version)

Submission history

From: Amin Nikanjam [view email]
[v1] Tue, 20 Jul 2021 23:42:39 GMT (478kb,D)
[v2] Sun, 25 Jul 2021 14:17:50 GMT (477kb,D)
[v3] Sun, 3 Oct 2021 14:04:33 GMT (1749kb)
[v4] Sun, 28 Nov 2021 22:55:52 GMT (1345kb)
[v5] Mon, 17 Jan 2022 19:23:02 GMT (2257kb)

Link back to: arXiv, form interface, contact.