We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.AI

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Artificial Intelligence

Title: Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation

Abstract: A key challenge on the path to developing agents that learn complex human-like behavior is the need to quickly and accurately quantify human-likeness. While human assessments of such behavior can be highly accurate, speed and scalability are limited. We address these limitations through a novel automated Navigation Turing Test (ANTT) that learns to predict human judgments of human-likeness. We demonstrate the effectiveness of our automated NTT on a navigation task in a complex 3D environment. We investigate six classification models to shed light on the types of architectures best suited to this task, and validate them against data collected through a human NTT. Our best models achieve high accuracy when distinguishing true human and agent behavior. At the same time, we show that predicting finer-grained human assessment of agents' progress towards human-like behavior remains unsolved. Our work takes an important step towards agents that more effectively learn complex human-like behavior.
Comments: All data collected throughout this study, plus the code to reproduce our analysis and ANTT are available at this https URL
Subjects: Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Journal reference: Proceedings of the 38th International Conference on Machine Learning (ICML), 139:2644-2653, 2021
Cite as: arXiv:2105.09637 [cs.AI]
  (or arXiv:2105.09637v2 [cs.AI] for this version)

Submission history

From: Sam Devlin [view email]
[v1] Thu, 20 May 2021 10:14:23 GMT (22626kb,D)
[v2] Wed, 28 Jul 2021 12:49:43 GMT (6297kb,D)

Link back to: arXiv, form interface, contact.