References & Citations
Computer Science > Robotics
Title: Residual Policy Learning for Shared Autonomy
(Submitted on 10 Apr 2020 (v1), last revised 10 Jul 2020 (this version, v2))
Abstract: Shared autonomy provides an effective framework for human-robot collaboration that takes advantage of the complementary strengths of humans and robots to achieve common goals. Many existing approaches to shared autonomy make restrictive assumptions that the goal space, environment dynamics, or human policy are known a priori, or are limited to discrete action spaces, preventing those methods from scaling to complicated real world environments. We propose a model-free, residual policy learning algorithm for shared autonomy that alleviates the need for these assumptions. Our agents are trained to minimally adjust the human's actions such that a set of goal-agnostic constraints are satisfied. We test our method in two continuous control environments: Lunar Lander, a 2D flight control domain, and a 6-DOF quadrotor reaching task. In experiments with human and surrogate pilots, our method significantly improves task performance without any knowledge of the human's goal beyond the constraints. These results highlight the ability of model-free deep reinforcement learning to realize assistive agents suited to continuous control settings with little knowledge of user intent.
Submission history
From: Charles Schaff [view email][v1] Fri, 10 Apr 2020 16:31:15 GMT (360kb,D)
[v2] Fri, 10 Jul 2020 15:00:42 GMT (360kb,D)
Link back to: arXiv, form interface, contact.