We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: A Visuospatial Dataset for Naturalistic Verb Learning

Abstract: We introduce a new dataset for training and evaluating grounded language models. Our data is collected within a virtual reality environment and is designed to emulate the quality of language data to which a pre-verbal child is likely to have access: That is, naturalistic, spontaneous speech paired with richly grounded visuospatial context. We use the collected data to compare several distributional semantics models for verb learning. We evaluate neural models based on 2D (pixel) features as well as feature-engineered models based on 3D (symbolic, spatial) features, and show that neither modeling approach achieves satisfactory performance. Our results are consistent with evidence from child language acquisition that emphasizes the difficulty of learning verbs from naive distributional data. We discuss avenues for future work on cognitively-inspired grounded language learning, and release our corpus with the intent of facilitating research on the topic.
Comments: 9 pages, 3 figures, starsem 2020
Subjects: Computation and Language (cs.CL)
ACM classes: I.2.7
Cite as: arXiv:2010.15225 [cs.CL]
  (or arXiv:2010.15225v1 [cs.CL] for this version)

Submission history

From: Dylan Ebert [view email]
[v1] Wed, 28 Oct 2020 20:47:13 GMT (7162kb,D)

Link back to: arXiv, form interface, contact.