Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations

Zhang, Chi; Kuppannagari, Sanmukh Rao; Prasanna, Viktor K

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2110

Computer Science > Machine Learning

Title: Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations

Authors: Chi Zhang, Sanmukh Rao Kuppannagari, Viktor K Prasanna

(Submitted on 3 Oct 2021 (v1), last revised 22 Dec 2021 (this version, v2))

Abstract: Reinforcement Learning (RL) has achieved significant success in application domains such as robotics, games and health care. However, training RL agents is very time consuming. Current implementations exhibit poor performance due to challenges such as irregular memory accesses and thread-level synchronization overheads on CPU. In this work, we propose a framework for generating scalable reinforcement learning implementations on multi-core systems. Replay Buffer is a key component of RL algorithms which facilitates storage of samples obtained from environmental interactions and data sampling for the learning process. We define a new data structure for Prioritized Replay Buffer based on $K$-ary sum tree that supports asynchronous parallel insertions, sampling, and priority updates. To address the challenge of irregular memory accesses, we propose a novel data layout to store the nodes of the sum tree that reduces the number of cache misses. Additionally, we propose $\textit{lazy writing}$ mechanism to reduce thread-level synchronization overheads of the Replay Buffer operations. Our framework employs parallel actors to concurrently collect data via environmental interactions, and parallel learners to perform stochastic gradient descent using the collected data. Our framework supports a wide range of reinforcement learning algorithms including DQN, DDPG, etc. We demonstrate the effectiveness of our framework in accelerating RL algorithms by performing experiments on CPU + GPU platform using OpenAI benchmarks.

Comments:	10 pages. HiPC21
Subjects:	Machine Learning (cs.LG); Systems and Control (eess.SY)
Cite as:	arXiv:2110.01101 [cs.LG]
	(or arXiv:2110.01101v2 [cs.LG] for this version)

Submission history

From: Chi Zhang [view email]
[v1] Sun, 3 Oct 2021 21:00:53 GMT (3205kb,D)
[v2] Wed, 22 Dec 2021 22:46:01 GMT (3513kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2110.01101

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Parallel Actors and Learners: A Framework for Generating Scalable RL Implementations

Submission history