What does the Failure to Reason with "Respectively" in Zero/Few-Shot Settings Tell Us about Language Models?

Cui, Ruixiang; Lee, Seolhwa; Hershcovich, Daniel; Søgaard, Anders

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2305

Computer Science > Computation and Language

Title: What does the Failure to Reason with "Respectively" in Zero/Few-Shot Settings Tell Us about Language Models?

Authors: Ruixiang Cui, Seolhwa Lee, Daniel Hershcovich, Anders Søgaard

(Submitted on 31 May 2023)

Abstract: Humans can effortlessly understand the coordinate structure of sentences such as "Niels Bohr and Kurt Cobain were born in Copenhagen and Seattle, respectively". In the context of natural language inference (NLI), we examine how language models (LMs) reason with respective readings (Gawron and Kehler, 2004) from two perspectives: syntactic-semantic and commonsense-world knowledge. We propose a controlled synthetic dataset WikiResNLI and a naturally occurring dataset NatResNLI to encompass various explicit and implicit realizations of "respectively". We show that fine-tuned NLI models struggle with understanding such readings without explicit supervision. While few-shot learning is easy in the presence of explicit cues, longer training is required when the reading is evoked implicitly, leaving models to rely on common sense inferences. Furthermore, our fine-grained analysis indicates models fail to generalize across different constructions. To conclude, we demonstrate that LMs still lag behind humans in generalizing to the long tail of linguistic constructions.

Comments:	To appear at ACL 2023
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2305.19597 [cs.CL]
	(or arXiv:2305.19597v1 [cs.CL] for this version)

Submission history

From: Ruixiang Cui [view email]
[v1] Wed, 31 May 2023 06:45:09 GMT (6930kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2305.19597

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: What does the Failure to Reason with "Respectively" in Zero/Few-Shot Settings Tell Us about Language Models?

Submission history