RobustLR: Evaluating Robustness to Logical Perturbation in Deductive Reasoning

Sanyal, Soumya; Liao, Zeyi; Ren, Xiang

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2205

Computer Science > Computation and Language

Title: RobustLR: Evaluating Robustness to Logical Perturbation in Deductive Reasoning

Authors: Soumya Sanyal, Zeyi Liao, Xiang Ren

(Submitted on 25 May 2022 (v1), last revised 8 Nov 2022 (this version, v2))

Abstract: Transformers have been shown to be able to perform deductive reasoning on a logical rulebase containing rules and statements written in English natural language. While the progress is promising, it is currently unclear if these models indeed perform logical reasoning by understanding the underlying logical semantics in the language. To this end, we propose RobustLR, a suite of evaluation datasets that evaluate the robustness of these models to minimal logical edits in rulebases and some standard logical equivalence conditions. In our experiments with RoBERTa and T5, we find that the models trained in prior works do not perform consistently on the different perturbations in RobustLR, thus showing that the models are not robust to the proposed logical perturbations. Further, we find that the models find it especially hard to learn logical negation and disjunction operators. Overall, using our evaluation sets, we demonstrate some shortcomings of the deductive reasoning-based language models, which can eventually help towards designing better models for logical reasoning over natural language. All the datasets and code base have been made publicly available.

Comments:	Accpeted at EMNLP 2022, code available at this https URL
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
Cite as:	arXiv:2205.12598 [cs.CL]
	(or arXiv:2205.12598v2 [cs.CL] for this version)

Submission history

From: Soumya Sanyal [view email]
[v1] Wed, 25 May 2022 09:23:50 GMT (7243kb,D)
[v2] Tue, 8 Nov 2022 06:14:13 GMT (656kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2205.12598

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: RobustLR: Evaluating Robustness to Logical Perturbation in Deductive Reasoning

Submission history