Prioritized Sweeping Neural DynaQ with Multiple Predecessors, and Hippocampal Replays

Aubin, Lise; Khamassi, Mehdi; Girard, Benoît

doi:10.1007/978-3-319-95972-6_4

Full-text links:

Download:

Current browse context:

cs.AI

< prev | next >

new | recent | 1802

Computer Science > Artificial Intelligence

Title: Prioritized Sweeping Neural DynaQ with Multiple Predecessors, and Hippocampal Replays

Authors: Lise Aubin, Mehdi Khamassi (ISIR), Benoît Girard (ISIR)

(Submitted on 15 Feb 2018 (v1), last revised 13 Aug 2018 (this version, v2))

Abstract: During sleep and awake rest, the hippocampus replays sequences of place cells that have been activated during prior experiences. These have been interpreted as a memory consolidation process, but recent results suggest a possible interpretation in terms of reinforcement learning. The Dyna reinforcement learning algorithms use off-line replays to improve learning. Under limited replay budget, a prioritized sweeping approach, which requires a model of the transitions to the predecessors, can be used to improve performance. We investigate whether such algorithms can explain the experimentally observed replays. We propose a neural network version of prioritized sweeping Q-learning, for which we developed a growing multiple expert algorithm, able to cope with multiple predecessors. The resulting architecture is able to improve the learning of simulated agents confronted to a navigation task. We predict that, in animals, learning the world model should occur during rest periods, and that the corresponding replays should be shuffled.

Comments:	Living Machines 2018 (Paris, France)
Subjects:	Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
DOI:	10.1007/978-3-319-95972-6_4
Cite as:	arXiv:1802.05594 [cs.AI]
	(or arXiv:1802.05594v2 [cs.AI] for this version)

Submission history

From: Benoît Girard [view email]
[v1] Thu, 15 Feb 2018 15:15:19 GMT (418kb,D)
[v2] Mon, 13 Aug 2018 12:27:55 GMT (636kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1802.05594

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Artificial Intelligence

Title: Prioritized Sweeping Neural DynaQ with Multiple Predecessors, and Hippocampal Replays

Submission history