Configuration Path Control

Pankov, Sergey

doi:10.1007/s12555-021-0466-5

Full-text links:

Download:

Current browse context:

cs.RO

< prev | next >

new | recent | 2204

Computer Science > Robotics

Title: Configuration Path Control

Authors: Sergey Pankov

(Submitted on 5 Apr 2022)

Abstract: Reinforcement learning methods often produce brittle policies -- policies that perform well during training, but generalize poorly beyond their direct training experience, thus becoming unstable under small disturbances. To address this issue, we propose a method for stabilizing a control policy in the space of configuration paths. It is applied post-training and relies purely on the data produced during training, as well as on an instantaneous control-matrix estimation. The approach is evaluated empirically on a planar bipedal walker subjected to a variety of perturbations. The control policies obtained via reinforcement learning are compared against their stabilized counterparts. Across different experiments, we find two- to four-fold increase in stability, when measured in terms of the perturbation amplitudes. We also provide a zero-dynamics interpretation of our approach.

Comments:	12 pages, 3 figures, accepted for publication
Subjects:	Robotics (cs.RO); Machine Learning (cs.LG); Systems and Control (eess.SY)
Journal reference:	Int. J. Control Autom. Syst. 21, 306-317 (2023)
DOI:	10.1007/s12555-021-0466-5
Cite as:	arXiv:2204.02471 [cs.RO]
	(or arXiv:2204.02471v1 [cs.RO] for this version)

Submission history

From: Sergey Pankov [view email]
[v1] Tue, 5 Apr 2022 20:11:39 GMT (232kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2204.02471

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Robotics

Title: Configuration Path Control

Submission history