We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Explain and Predict, and then Predict Again

Abstract: A desirable property of learning systems is to be both effective and interpretable. Towards this goal, recent models have been proposed that first generate an extractive explanation from the input text and then generate a prediction on just the explanation called explain-then-predict models. These models primarily consider the task input as a supervision signal in learning an extractive explanation and do not effectively integrate rationales data as an additional inductive bias to improve task performance. We propose a novel yet simple approach ExPred, that uses multi-task learning in the explanation generation phase effectively trading-off explanation and prediction losses. And then we use another prediction network on just the extracted explanations for optimizing the task performance. We conduct an extensive evaluation of our approach on three diverse language datasets -- fact verification, sentiment classification, and QA -- and find that we substantially outperform existing approaches.
Comments: Accepted in the WSDM 2021
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
ACM classes: I.2.m; I.2.7
DOI: 10.1145/3437963.3441758
Cite as: arXiv:2101.04109 [cs.CL]
  (or arXiv:2101.04109v2 [cs.CL] for this version)

Submission history

From: Zijian Zhang [view email]
[v1] Mon, 11 Jan 2021 19:36:52 GMT (5706kb,D)
[v2] Thu, 4 Feb 2021 05:19:23 GMT (12916kb,D)

Link back to: arXiv, form interface, contact.