We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.NE

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Neural and Evolutionary Computing

Title: Exposing the Functionalities of Neurons for Gated Recurrent Unit Based Sequence-to-Sequence Model

Abstract: The goal of this paper is to report certain scientific discoveries about a Seq2Seq model. It is known that analyzing the behavior of RNN-based models at the neuron level is considered a more challenging task than analyzing a DNN or CNN models due to their recursive mechanism in nature. This paper aims to provide neuron-level analysis to explain why a vanilla GRU-based Seq2Seq model without attention can achieve token-positioning. We found four different types of neurons: storing, counting, triggering, and outputting and further uncover the mechanism for these neurons to work together in order to produce the right token in the right position.
Comments: 9 pages (excluding reference), 10 figures
Subjects: Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as: arXiv:2303.15072 [cs.NE]
  (or arXiv:2303.15072v1 [cs.NE] for this version)

Submission history

From: Yi-Ting Lee [view email]
[v1] Mon, 27 Mar 2023 10:25:46 GMT (1396kb,D)

Link back to: arXiv, form interface, contact.