Data Informed Residual Reinforcement Learning for High-Dimensional Robotic Tracking Control

Li, Cong; Liu, Fangzhou; Wang, Yongchao; Buss, Martin

Full-text links:

Download:

Current browse context:

eess.SY

< prev | next >

new | recent | 2110

Electrical Engineering and Systems Science > Systems and Control

Title: Data Informed Residual Reinforcement Learning for High-Dimensional Robotic Tracking Control

Authors: Cong Li, Fangzhou Liu, Yongchao Wang, Martin Buss

(Submitted on 28 Oct 2021 (v1), last revised 29 Aug 2023 (this version, v4))

Abstract: The learning inefficiency of reinforcement learning (RL) from scratch hinders its practical application towards continuous robotic tracking control, especially for high-dimensional robots. This work proposes a data informed residual reinforcement learning (DR-RL) based robotic tracking control scheme applicable to robots with high dimensionality. The proposed DR-RL methodology outperforms its standard RL from scratch counterpart regarding sample efficiency and scalability. Specifically, we first decouple the original robot into low-dimensional robotic subsystems; and further utilize one-step backward (OSBK) data to construct incremental subsystems that are equivalent model-free representations of the above decoupled robotic subsystems. The formulated incremental subsystems allow for parallel learning to relieve computation load and offer us mathematical descriptions of robotic movements for conducting theoretical analysis. Then, we apply DR-RL to learn the tracking control policy, a combination of incremental base policy and incremental residual policy, under a parallel learning architecture. The incremental residual policy uses the guidance from the incremental base policy as the learning initialization and further learns from interactions with environments to endow the tracking control policy with adaptability towards dynamically changing environments. Our proposed DR-RL based tracking control scheme is developed with rigorous theoretical analysis of system stability and weight convergence, and validated numerically on comparative simulations and also experimentally on a 3-DoF robot manipulator that would fail for other counterpart RL methods.

Subjects:	Systems and Control (eess.SY)
Cite as:	arXiv:2110.15237 [eess.SY]
	(or arXiv:2110.15237v4 [eess.SY] for this version)

Submission history

From: Cong Li [view email]
[v1] Thu, 28 Oct 2021 15:51:36 GMT (7319kb)
[v2] Wed, 16 Mar 2022 20:00:19 GMT (4148kb,D)
[v3] Mon, 1 Aug 2022 14:33:59 GMT (4146kb,D)
[v4] Tue, 29 Aug 2023 14:21:18 GMT (1753kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2110.15237

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Systems and Control

Title: Data Informed Residual Reinforcement Learning for High-Dimensional Robotic Tracking Control

Submission history