Unobserved Local Structures Make Compositional Generalization Hard

Bogin, Ben; Gupta, Shivanshu; Berant, Jonathan

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2201

Change to browse by:

Computer Science > Computation and Language

Title: Unobserved Local Structures Make Compositional Generalization Hard

Authors: Ben Bogin, Shivanshu Gupta, Jonathan Berant

(Submitted on 15 Jan 2022 (v1), last revised 22 Oct 2022 (this version, v2))

Abstract: While recent work has convincingly showed that sequence-to-sequence models struggle to generalize to new compositions (termed compositional generalization), little is known on what makes compositional generalization hard on a particular test instance. In this work, we investigate what are the factors that make generalization to certain test instances challenging. We first substantiate that indeed some examples are more difficult than others by showing that different models consistently fail or succeed on the same test instances. Then, we propose a criterion for the difficulty of an example: a test instance is hard if it contains a local structure that was not observed at training time. We formulate a simple decision rule based on this criterion and empirically show it predicts instance-level generalization well across 5 different semantic parsing datasets, substantially better than alternative decision rules. Last, we show local structures can be leveraged for creating difficult adversarial compositional splits and also to improve compositional generalization under limited training budgets by strategically selecting examples for the training set.

Comments:	EMNLP 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2201.05899 [cs.CL]
	(or arXiv:2201.05899v2 [cs.CL] for this version)

Submission history

From: Ben Bogin [view email]
[v1] Sat, 15 Jan 2022 18:03:29 GMT (6601kb,D)
[v2] Sat, 22 Oct 2022 11:30:34 GMT (6510kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2201.05899

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Unobserved Local Structures Make Compositional Generalization Hard

Submission history