Interactive Model with Structural Loss for Language-based Abductive Reasoning

Li, Linhao; Xu, Ming; Dong, Yongfeng; Li, Xin; Wang, Ao

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2112

Computer Science > Computation and Language

Title: Interactive Model with Structural Loss for Language-based Abductive Reasoning

Authors: Linhao Li, Ming Xu, Yongfeng Dong, Xin Li, Ao Wang

(Submitted on 1 Dec 2021 (v1), last revised 20 Dec 2022 (this version, v2))

Abstract: The abductive natural language inference task ($\alpha$NLI) is proposed to infer the most plausible explanation between the cause and the event. In the $\alpha$NLI task, two observations are given, and the most plausible hypothesis is asked to pick out from the candidates. Existing methods model the relation between each candidate hypothesis separately and penalize the inference network uniformly. In this paper, we argue that it is unnecessary to distinguish the reasoning abilities among correct hypotheses; and similarly, all wrong hypotheses contribute the same when explaining the reasons of the observations. Therefore, we propose to group instead of ranking the hypotheses and design a structural loss called ``joint softmax focal loss'' in this paper. Based on the observation that the hypotheses are generally semantically related, we have designed a novel interactive language model aiming at exploiting the rich interaction among competing hypotheses. We name this new model for $\alpha$NLI: Interactive Model with Structural Loss (IMSL). The experimental results show that our IMSL has achieved the highest performance on the RoBERTa-large pretrained model, with ACC and AUC results increased by about 1\% and 5\% respectively.

Comments:	The paper is under consideration at Pattern Recognition Letters
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2112.00284 [cs.CL]
	(or arXiv:2112.00284v2 [cs.CL] for this version)

Submission history

From: Ao Wang [view email]
[v1] Wed, 1 Dec 2021 05:21:07 GMT (266kb,D)
[v2] Tue, 20 Dec 2022 09:23:32 GMT (266kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2112.00284

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Interactive Model with Structural Loss for Language-based Abductive Reasoning

Submission history