Low-Resource Compositional Semantic Parsing with Concept Pretraining

Rongali, Subendhu; Sridhar, Mukund; Khan, Haidar; Arkoudas, Konstantine; Hamza, Wael; McCallum, Andrew

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2301

Change to browse by:

Computer Science > Computation and Language

Title: Low-Resource Compositional Semantic Parsing with Concept Pretraining

Authors: Subendhu Rongali, Mukund Sridhar, Haidar Khan, Konstantine Arkoudas, Wael Hamza, Andrew McCallum

(Submitted on 24 Jan 2023 (v1), last revised 30 Jan 2023 (this version, v2))

Abstract: Semantic parsing plays a key role in digital voice assistants such as Alexa, Siri, and Google Assistant by mapping natural language to structured meaning representations. When we want to improve the capabilities of a voice assistant by adding a new domain, the underlying semantic parsing model needs to be retrained using thousands of annotated examples from the new domain, which is time-consuming and expensive. In this work, we present an architecture to perform such domain adaptation automatically, with only a small amount of metadata about the new domain and without any new training data (zero-shot) or with very few examples (few-shot). We use a base seq2seq (sequence-to-sequence) architecture and augment it with a concept encoder that encodes intent and slot tags from the new domain. We also introduce a novel decoder-focused approach to pretrain seq2seq models to be concept aware using Wikidata and use it to help our model learn important concepts and perform well in low-resource settings. We report few-shot and zero-shot results for compositional semantic parsing on the TOPv2 dataset and show that our model outperforms prior approaches in few-shot settings for the TOPv2 and SNIPS datasets.

Comments:	EACL 2023
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2301.09809 [cs.CL]
	(or arXiv:2301.09809v2 [cs.CL] for this version)

Submission history

From: Subendhu Rongali [view email]
[v1] Tue, 24 Jan 2023 04:27:27 GMT (6938kb,D)
[v2] Mon, 30 Jan 2023 20:49:49 GMT (6938kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2301.09809v2

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Low-Resource Compositional Semantic Parsing with Concept Pretraining

Submission history