We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.AI

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Artificial Intelligence

Title: TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations

Abstract: Deep reinforcement learning (DRL) has achieved super-human performance on complex video games (e.g., StarCraft II and Dota II). However, current DRL systems still suffer from challenges of multi-agent coordination, sparse rewards, stochastic environments, etc. In seeking to address these challenges, we employ a football video game, e.g., Google Research Football (GRF), as our testbed and develop an end-to-end learning-based AI system (denoted as TiKick) to complete this challenging task. In this work, we first generated a large replay dataset from the self-playing of single-agent experts, which are obtained from league training. We then developed a distributed learning system and new offline algorithms to learn a powerful multi-agent AI from the fixed single-agent dataset. To the best of our knowledge, Tikick is the first learning-based AI system that can take over the multi-agent Google Research Football full game, while previous work could either control a single agent or experiment on toy academic scenarios. Extensive experiments further show that our pre-trained model can accelerate the training process of the modern multi-agent algorithm and our method achieves state-of-the-art performances on various academic scenarios.
Subjects: Artificial Intelligence (cs.AI)
Cite as: arXiv:2110.04507 [cs.AI]
  (or arXiv:2110.04507v5 [cs.AI] for this version)

Submission history

From: Shiyu Huang [view email]
[v1] Sat, 9 Oct 2021 08:34:58 GMT (7937kb,D)
[v2] Tue, 12 Oct 2021 05:25:00 GMT (7937kb,D)
[v3] Sat, 16 Oct 2021 07:47:25 GMT (7937kb,D)
[v4] Tue, 19 Oct 2021 08:41:27 GMT (7937kb,D)
[v5] Tue, 30 Nov 2021 10:06:39 GMT (7937kb,D)

Link back to: arXiv, form interface, contact.