We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Mind The Facts: Knowledge-Boosted Coherent Abstractive Text Summarization

Abstract: Neural models have become successful at producing abstractive summaries that are human-readable and fluent. However, these models have two critical shortcomings: they often don't respect the facts that are either included in the source article or are known to humans as commonsense knowledge, and they don't produce coherent summaries when the source article is long. In this work, we propose a novel architecture that extends Transformer encoder-decoder architecture in order to improve on these shortcomings. First, we incorporate entity-level knowledge from the Wikidata knowledge graph into the encoder-decoder architecture. Injecting structural world knowledge from Wikidata helps our abstractive summarization model to be more fact-aware. Second, we utilize the ideas used in Transformer-XL language model in our proposed encoder-decoder architecture. This helps our model with producing coherent summaries even when the source article is long. We test our model on CNN/Daily Mail summarization dataset and show improvements on ROUGE scores over the baseline Transformer model. We also include model predictions for which our model accurately conveys the facts, while the baseline Transformer model doesn't.
Comments: NeurIPS 2019, Knowledge Representation & Reasoning Meets Machine Learning (KR2ML workshop)
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2006.15435 [cs.CL]
  (or arXiv:2006.15435v1 [cs.CL] for this version)

Submission history

From: Chenguang Zhu [view email]
[v1] Sat, 27 Jun 2020 20:06:15 GMT (379kb,D)

Link back to: arXiv, form interface, contact.