We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Contextualized Attention-based Knowledge Transfer for Spoken Conversational Question Answering

Abstract: Spoken conversational question answering (SCQA) requires machines to model complex dialogue flow given the speech utterances and text corpora. Different from traditional text question answering (QA) tasks, SCQA involves audio signal processing, passage comprehension, and contextual understanding. However, ASR systems introduce unexpected noisy signals to the transcriptions, which result in performance degradation on SCQA. To overcome the problem, we propose CADNet, a novel contextualized attention-based distillation approach, which applies both cross-attention and self-attention to obtain ASR-robust contextualized embedding representations of the passage and dialogue history for performance improvements. We also introduce the spoken conventional knowledge distillation framework to distill the ASR-robust knowledge from the estimated probabilities of the teacher model to the student. We conduct extensive experiments on the Spoken-CoQA dataset and demonstrate that our approach achieves remarkable performance in this task.
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as: arXiv:2010.11066 [cs.CL]
  (or arXiv:2010.11066v4 [cs.CL] for this version)

Submission history

From: Chenyu You [view email]
[v1] Wed, 21 Oct 2020 15:17:18 GMT (82kb,D)
[v2] Sat, 27 Feb 2021 02:17:26 GMT (83kb,D)
[v3] Mon, 14 Jun 2021 20:04:11 GMT (1768kb,D)
[v4] Thu, 24 Jun 2021 16:32:18 GMT (1991kb,D)

Link back to: arXiv, form interface, contact.