We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Emergent Discrete Communication in Semantic Spaces

Abstract: Neural agents trained in reinforcement learning settings can learn to communicate among themselves via discrete tokens, accomplishing as a team what agents would be unable to do alone. However, the current standard of using one-hot vectors as discrete communication tokens prevents agents from acquiring more desirable aspects of communication such as zero-shot understanding. Inspired by word embedding techniques from natural language processing, we propose neural agent architectures that enables them to communicate via discrete tokens derived from a learned, continuous space. We show in a decision theoretic framework that our technique optimizes communication over a wide range of scenarios, whereas one-hot tokens are only optimal under restrictive assumptions. In self-play experiments, we validate that our trained agents learn to cluster tokens in semantically-meaningful ways, allowing them communicate in noisy environments where other techniques fail. Lastly, we demonstrate both that agents using our method can effectively respond to novel human communication and that humans can understand unlabeled emergent agent communication, outperforming the use of one-hot communication.
Subjects: Machine Learning (cs.LG); Computation and Language (cs.CL); Multiagent Systems (cs.MA); Robotics (cs.RO)
Cite as: arXiv:2108.01828 [cs.LG]
  (or arXiv:2108.01828v3 [cs.LG] for this version)

Submission history

From: Mycal Tucker [view email]
[v1] Wed, 4 Aug 2021 03:32:48 GMT (2389kb,D)
[v2] Thu, 5 Aug 2021 14:57:47 GMT (2389kb,D)
[v3] Thu, 4 Nov 2021 18:55:33 GMT (2389kb,D)

Link back to: arXiv, form interface, contact.