We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Generating Natural Language Proofs with Verifier-Guided Search

Abstract: Deductive reasoning over natural language is a challenging problem in NLP. In this work, we focus on proof generation: Given a hypothesis and a set of supporting facts, the model generates a proof tree indicating how to deduce the hypothesis from supporting facts. Compared to generating the entire proof in one shot, stepwise generation can better exploit the compositionality and generalize to longer proofs but has achieved limited success on real-world data. Existing stepwise methods struggle to generate proof steps that are both logically valid and relevant to the hypothesis. Instead, they tend to hallucinate invalid steps given the hypothesis. In this paper, we present a novel stepwise method, NLProofS (Natural Language Proof Search), which learns to generate relevant steps conditioning on the hypothesis. At the core of our approach, we train an independent verifier to check the validity of the proof steps to prevent hallucination. Instead of generating steps greedily, we search for proofs maximizing a global proof score judged by the verifier. NLProofS achieves state-of-the-art performance on EntailmentBank and RuleTaker. Specifically, it improves the correctness of predicted proofs from 27.7% to 33.3% in the distractor setting of EntailmentBank, demonstrating the effectiveness of NLProofS in generating challenging human-authored proofs.
Comments: EMNLP 2022. Code and models are available at this https URL
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG); Logic in Computer Science (cs.LO)
Cite as: arXiv:2205.12443 [cs.CL]
  (or arXiv:2205.12443v2 [cs.CL] for this version)

Submission history

From: Kaiyu Yang [view email]
[v1] Wed, 25 May 2022 02:22:30 GMT (309kb,D)
[v2] Tue, 18 Oct 2022 17:33:26 GMT (317kb,D)
[v3] Fri, 21 Oct 2022 20:08:11 GMT (343kb,D)

Link back to: arXiv, form interface, contact.