We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Machines Getting with the Program: Understanding Intent Arguments of Non-Canonical Directives

Abstract: Modern dialog managers face the challenge of having to fulfill human-level conversational skills as part of common user expectations, including but not limited to discourse with no clear objective. Along with these requirements, agents are expected to extrapolate intent from the user's dialogue even when subjected to non-canonical forms of speech. This depends on the agent's comprehension of paraphrased forms of such utterances. In low-resource languages, the lack of data is a bottleneck that prevents advancements of the comprehension performance for these types of agents. In this paper, we demonstrate the necessity of being able to extract the intent argument of non-canonical directives, and also define guidelines for building paired corpora for this purpose. Following the guidelines, we label a dataset consisting of 30K instances of question/command-intent pairs, including annotations for a classification task for predicting the utterance type. We also propose a method for mitigating class imbalance in the final dataset, and demonstrate the potential applications of the corpus generation method and dataset.
Comments: Submitted to LREC 2020; 9 pages, 2 figures, 4 tables
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:1912.00342 [cs.CL]
  (or arXiv:1912.00342v1 [cs.CL] for this version)

Submission history

From: Won Ik Cho [view email]
[v1] Sun, 1 Dec 2019 07:08:19 GMT (100kb,D)
[v2] Wed, 7 Oct 2020 08:55:30 GMT (199kb,D)

Link back to: arXiv, form interface, contact.