We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: A Hybrid Neural Network Model for Commonsense Reasoning

Abstract: This paper proposes a hybrid neural network (HNN) model for commonsense reasoning. An HNN consists of two component models, a masked language model and a semantic similarity model, which share a BERT-based contextual encoder but use different model-specific input and output layers. HNN obtains new state-of-the-art results on three classic commonsense reasoning tasks, pushing the WNLI benchmark to 89%, the Winograd Schema Challenge (WSC) benchmark to 75.1%, and the PDP60 benchmark to 90.0%. An ablation study shows that language models and semantic similarity models are complementary approaches to commonsense reasoning, and HNN effectively combines the strengths of both. The code and pre-trained models will be publicly available at this https URL
Comments: 9 pages, 3 figures, 6 tables
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:1907.11983 [cs.CL]
  (or arXiv:1907.11983v1 [cs.CL] for this version)

Submission history

From: Xiaodong Liu [view email]
[v1] Sat, 27 Jul 2019 21:51:52 GMT (318kb,D)

Link back to: arXiv, form interface, contact.