Associative Memory Based Experience Replay for Deep Reinforcement Learning

Li, Mengyuan; Kazemi, Arman; Laguna, Ann Franchesca; Hu, X. Sharon

doi:10.1145/3508352.3549387

Full-text links:

Download:

Current browse context:

cs.AR

< prev | next >

new | recent | 2207

Computer Science > Hardware Architecture

Title: Associative Memory Based Experience Replay for Deep Reinforcement Learning

Authors: Mengyuan Li, Arman Kazemi, Ann Franchesca Laguna, X. Sharon Hu

(Submitted on 16 Jul 2022)

Abstract: Experience replay is an essential component in deep reinforcement learning (DRL), which stores the experiences and generates experiences for the agent to learn in real time. Recently, prioritized experience replay (PER) has been proven to be powerful and widely deployed in DRL agents. However, implementing PER on traditional CPU or GPU architectures incurs significant latency overhead due to its frequent and irregular memory accesses. This paper proposes a hardware-software co-design approach to design an associative memory (AM) based PER, AMPER, with an AM-friendly priority sampling operation. AMPER replaces the widely-used time-costly tree-traversal-based priority sampling in PER while preserving the learning performance. Further, we design an in-memory computing hardware architecture based on AM to support AMPER by leveraging parallel in-memory search operations. AMPER shows comparable learning performance while achieving 55x to 270x latency improvement when running on the proposed hardware compared to the state-of-the-art PER running on GPU.

Comments:	9 pages, 9 figures. The work was accepted by the 41st International Conference on Computer-Aided Design (ICCAD), 2022, San Diego
Subjects:	Hardware Architecture (cs.AR); Emerging Technologies (cs.ET); Machine Learning (cs.LG)
DOI:	10.1145/3508352.3549387
Cite as:	arXiv:2207.07791 [cs.AR]
	(or arXiv:2207.07791v1 [cs.AR] for this version)

Submission history

From: Mengyuan Li [view email]
[v1] Sat, 16 Jul 2022 00:12:12 GMT (6875kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2207.07791

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Hardware Architecture

Title: Associative Memory Based Experience Replay for Deep Reinforcement Learning

Submission history