We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Towards Improving Selective Prediction Ability of NLP Systems

Abstract: It's better to say "I can't answer" than to answer incorrectly. This selective prediction ability is crucial for NLP systems to be reliably deployed in real-world applications. Prior work has shown that existing selective prediction techniques fail to perform well, especially in the out-of-domain setting. In this work, we propose a method that improves probability estimates of models by calibrating them using prediction confidence and difficulty score of instances. Using these two signals, we first annotate held-out instances and then train a calibrator to predict the likelihood of correctness of the model's prediction. We instantiate our method with Natural Language Inference (NLI) and Duplicate Detection (DD) tasks and evaluate it in both In-Domain (IID) and Out-of-Domain (OOD) settings. In (IID, OOD) settings, we show that the representations learned by our calibrator result in an improvement of (15.81%, 5.64%) and (6.19%, 13.9%) over 'MaxProb' -- a selective prediction baseline -- on NLI and DD tasks respectively.
Comments: ACL 2022 RepL4NLP Workshop
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as: arXiv:2008.09371 [cs.CL]
  (or arXiv:2008.09371v3 [cs.CL] for this version)

Submission history

From: Neeraj Varshney [view email]
[v1] Fri, 21 Aug 2020 08:46:36 GMT (9044kb,D)
[v2] Tue, 29 Mar 2022 06:35:11 GMT (250kb,D)
[v3] Thu, 7 Apr 2022 00:22:04 GMT (250kb,D)

Link back to: arXiv, form interface, contact.