We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.NE

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Neural and Evolutionary Computing

Title: The Sensory Neuron as a Transformer: Permutation-Invariant Neural Networks for Reinforcement Learning

Abstract: In complex systems, we often observe complex global behavior emerge from a collection of agents interacting with each other in their environment, with each individual agent acting only on locally available information, without knowing the full picture. Such systems have inspired development of artificial intelligence algorithms in areas such as swarm optimization and cellular automata. Motivated by the emergence of collective behavior from complex cellular systems, we build systems that feed each sensory input from the environment into distinct, but identical neural networks, each with no fixed relationship with one another. We show that these sensory networks can be trained to integrate information received locally, and through communication via an attention mechanism, can collectively produce a globally coherent policy. Moreover, the system can still perform its task even if the ordering of its inputs is randomly permuted several times during an episode. These permutation invariant systems also display useful robustness and generalization properties that are broadly applicable. Interactive demo and videos of our results: this https URL
Comments: To appear at the 35th Conference on Neural Information Processing Systems (NeurIPS 2021). Selected for a spotlight presentation
Subjects: Neural and Evolutionary Computing (cs.NE)
Cite as: arXiv:2109.02869 [cs.NE]
  (or arXiv:2109.02869v2 [cs.NE] for this version)

Submission history

From: David Ha [view email]
[v1] Tue, 7 Sep 2021 05:12:50 GMT (4343kb,D)
[v2] Wed, 29 Sep 2021 00:59:49 GMT (4331kb,D)

Link back to: arXiv, form interface, contact.