We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.RO

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Robotics

Title: Interactively Picking Real-World Objects with Unconstrained Spoken Language Instructions

Abstract: Comprehension of spoken natural language is an essential component for robots to communicate with human effectively. However, handling unconstrained spoken instructions is challenging due to (1) complex structures including a wide variety of expressions used in spoken language and (2) inherent ambiguity in interpretation of human instructions. In this paper, we propose the first comprehensive system that can handle unconstrained spoken language and is able to effectively resolve ambiguity in spoken instructions. Specifically, we integrate deep-learning-based object detection together with natural language processing technologies to handle unconstrained spoken instructions, and propose a method for robots to resolve instruction ambiguity through dialogue. Through our experiments on both a simulated environment as well as a physical industrial robot arm, we demonstrate the ability of our system to understand natural instructions from human operators effectively, and how higher success rates of the object picking task can be achieved through an interactive clarification process.
Comments: 9 pages. International Conference on Robotics and Automation (ICRA) 2018. Accompanying videos are available at the following links: this https URL (the system submitted to ICRA-2018) and this http URL (with improvements after ICRA-2018 submission)
Subjects: Robotics (cs.RO); Computation and Language (cs.CL)
Cite as: arXiv:1710.06280 [cs.RO]
  (or arXiv:1710.06280v2 [cs.RO] for this version)

Submission history

From: Sosuke Kobayashi [view email]
[v1] Tue, 17 Oct 2017 13:46:59 GMT (2479kb,D)
[v2] Wed, 28 Mar 2018 03:11:49 GMT (3162kb,D)

Link back to: arXiv, form interface, contact.