We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.RO

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Robotics

Title: Configuration Path Control

Authors: Sergey Pankov
Abstract: Reinforcement learning methods often produce brittle policies -- policies that perform well during training, but generalize poorly beyond their direct training experience, thus becoming unstable under small disturbances. To address this issue, we propose a method for stabilizing a control policy in the space of configuration paths. It is applied post-training and relies purely on the data produced during training, as well as on an instantaneous control-matrix estimation. The approach is evaluated empirically on a planar bipedal walker subjected to a variety of perturbations. The control policies obtained via reinforcement learning are compared against their stabilized counterparts. Across different experiments, we find two- to four-fold increase in stability, when measured in terms of the perturbation amplitudes. We also provide a zero-dynamics interpretation of our approach.
Comments: 12 pages, 3 figures, accepted for publication
Subjects: Robotics (cs.RO); Machine Learning (cs.LG); Systems and Control (eess.SY)
Journal reference: Int. J. Control Autom. Syst. 21, 306-317 (2023)
DOI: 10.1007/s12555-021-0466-5
Cite as: arXiv:2204.02471 [cs.RO]
  (or arXiv:2204.02471v1 [cs.RO] for this version)

Submission history

From: Sergey Pankov [view email]
[v1] Tue, 5 Apr 2022 20:11:39 GMT (232kb,D)

Link back to: arXiv, form interface, contact.