Hierarchical Memory Networks

Chandar, Sarath; Ahn, Sungjin; Larochelle, Hugo; Vincent, Pascal; Tesauro, Gerald; Bengio, Yoshua

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 1605

Statistics > Machine Learning

Title: Hierarchical Memory Networks

Authors: Sarath Chandar, Sungjin Ahn, Hugo Larochelle, Pascal Vincent, Gerald Tesauro, Yoshua Bengio

(Submitted on 24 May 2016)

Abstract: Memory networks are neural networks with an explicit memory component that can be both read and written to by the network. The memory is often addressed in a soft way using a softmax function, making end-to-end training with backpropagation possible. However, this is not computationally scalable for applications which require the network to read from extremely large memories. On the other hand, it is well known that hard attention mechanisms based on reinforcement learning are challenging to train successfully. In this paper, we explore a form of hierarchical memory network, which can be considered as a hybrid between hard and soft attention memory networks. The memory is organized in a hierarchical structure such that reading from it is done with less computation than soft attention over a flat memory, while also being easier to train than hard attention over a flat memory. Specifically, we propose to incorporate Maximum Inner Product Search (MIPS) in the training and inference procedures for our hierarchical memory network. We explore the use of various state-of-the art approximate MIPS techniques and report results on SimpleQuestions, a challenging large scale factoid question answering task.

Comments:	10 pages
Subjects:	Machine Learning (stat.ML); Computation and Language (cs.CL); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1605.07427 [stat.ML]
	(or arXiv:1605.07427v1 [stat.ML] for this version)

Submission history

From: Sarath Chandar [view email]
[v1] Tue, 24 May 2016 12:48:19 GMT (29kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:1605.07427

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Hierarchical Memory Networks

Submission history