We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Machine Learning

Title: Learning to Brachiate via Simplified Model Imitation

Abstract: Brachiation is the primary form of locomotion for gibbons and siamangs, in which these primates swing from tree limb to tree limb using only their arms. It is challenging to control because of the limited control authority, the required advance planning, and the precision of the required grasps. We present a novel approach to this problem using reinforcement learning, and as demonstrated on a finger-less 14-link planar model that learns to brachiate across challenging handhold sequences. Key to our method is the use of a simplified model, a point mass with a virtual arm, for which we first learn a policy that can brachiate across handhold sequences with a prescribed order. This facilitates the learning of the policy for the full model, for which it provides guidance by providing an overall center-of-mass trajectory to imitate, as well as for the timing of the holds. Lastly, the simplified model can also readily be used for planning suitable sequences of handholds in a given environment. Our results demonstrate brachiation motions with a variety of durations for the flight and hold phases, as well as emergent extra back-and-forth swings when this proves useful. The system is evaluated with a variety of ablations. The method enables future work towards more general 3D brachiation, as well as using simplified model imitation in other settings.
Comments: 11 pages, 6 figures. Accepted at SIGGRAPH 2022. For videos, supplementary material and code, visit the following URL this https URL
Subjects: Machine Learning (cs.LG); Graphics (cs.GR); Robotics (cs.RO)
DOI: 10.1145/3528233.3530728 10.1145/3528233.3530728 10.1145/3528233.3530728
Cite as: arXiv:2205.03943 [cs.LG]
  (or arXiv:2205.03943v1 [cs.LG] for this version)

Submission history

From: Daniele Reda [view email]
[v1] Sun, 8 May 2022 19:44:19 GMT (2870kb,D)

Link back to: arXiv, form interface, contact.