References & Citations
Computer Science > Neural and Evolutionary Computing
Title: LSTM with Working Memory
(Submitted on 6 May 2016 (v1), last revised 30 Mar 2017 (this version, v3))
Abstract: Previous RNN architectures have largely been superseded by LSTM, or "Long Short-Term Memory". Since its introduction, there have been many variations on this simple design. However, it is still widely used and we are not aware of a gated-RNN architecture that outperforms LSTM in a broad sense while still being as simple and efficient. In this paper we propose a modified LSTM-like architecture. Our architecture is still simple and achieves better performance on the tasks that we tested on. We also introduce a new RNN performance benchmark that uses the handwritten digits and stresses several important network capabilities.
Submission history
From: Andrew Pulver [view email][v1] Fri, 6 May 2016 16:11:45 GMT (151kb,D)
[v2] Fri, 13 May 2016 16:47:41 GMT (150kb,D)
[v3] Thu, 30 Mar 2017 18:24:55 GMT (447kb,D)
Link back to: arXiv, form interface, contact.