Co-Imitation: Learning Design and Behaviour by Imitation

Rajani, Chang; Arndt, Karol; Blanco-Mulero, David; Luck, Kevin Sebastian; Kyrki, Ville

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2209

Computer Science > Machine Learning

Title: Co-Imitation: Learning Design and Behaviour by Imitation

Authors: Chang Rajani, Karol Arndt, David Blanco-Mulero, Kevin Sebastian Luck, Ville Kyrki

(Submitted on 2 Sep 2022 (v1), last revised 7 Feb 2023 (this version, v2))

Abstract: The co-adaptation of robots has been a long-standing research endeavour with the goal of adapting both body and behaviour of a system for a given task, inspired by the natural evolution of animals. Co-adaptation has the potential to eliminate costly manual hardware engineering as well as improve the performance of systems. The standard approach to co-adaptation is to use a reward function for optimizing behaviour and morphology. However, defining and constructing such reward functions is notoriously difficult and often a significant engineering effort. This paper introduces a new viewpoint on the co-adaptation problem, which we call co-imitation: finding a morphology and a policy that allow an imitator to closely match the behaviour of a demonstrator. To this end we propose a co-imitation methodology for adapting behaviour and morphology by matching state distributions of the demonstrator. Specifically, we focus on the challenging scenario with mismatched state- and action-spaces between both agents. We find that co-imitation increases behaviour similarity across a variety of tasks and settings, and demonstrate co-imitation by transferring human walking, jogging and kicking skills onto a simulated humanoid.

Comments:	14 pages, 11 figures, accepted for AAAI-23
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2209.01207 [cs.LG]
	(or arXiv:2209.01207v2 [cs.LG] for this version)

Submission history

From: Kevin Sebastian Luck [view email]
[v1] Fri, 2 Sep 2022 17:57:32 GMT (7531kb,D)
[v2] Tue, 7 Feb 2023 03:58:21 GMT (6920kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2209.01207

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Co-Imitation: Learning Design and Behaviour by Imitation

Submission history