A Data Efficient End-To-End Spoken Language Understanding Architecture

Dinarelli, Marco; Kapoor, Nikita; Jabaian, Bassam; Besacier, Laurent

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2002

Computer Science > Computation and Language

Title: A Data Efficient End-To-End Spoken Language Understanding Architecture

Authors: Marco Dinarelli, Nikita Kapoor, Bassam Jabaian, Laurent Besacier

(Submitted on 14 Feb 2020)

Abstract: End-to-end architectures have been recently proposed for spoken language understanding (SLU) and semantic parsing. Based on a large amount of data, those models learn jointly acoustic and linguistic-sequential features. Such architectures give very good results in the context of domain, intent and slot detection, their application in a more complex semantic chunking and tagging task is less easy. For that, in many cases, models are combined with an external language model to enhance their performance.
In this paper we introduce a data efficient system which is trained end-to-end, with no additional, pre-trained external module. One key feature of our approach is an incremental training procedure where acoustic, language and semantic models are trained sequentially one after the other. The proposed model has a reasonable size and achieves competitive results with respect to state-of-the-art while using a small training dataset. In particular, we reach 24.02% Concept Error Rate (CER) on MEDIA/test while training on MEDIA/train without any additional data.

Comments:	Accepted to ICASSP 2020
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2002.05955 [cs.CL]
	(or arXiv:2002.05955v1 [cs.CL] for this version)

Submission history

From: Laurent Besacier [view email]
[v1] Fri, 14 Feb 2020 10:24:42 GMT (178kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2002.05955

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: A Data Efficient End-To-End Spoken Language Understanding Architecture

Submission history