Parallel Hierarchical Transformer with Attention Alignment for Abstractive Multi-Document Summarization

Ma, Ye; Zong, Lu

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2208

Computer Science > Computation and Language

Title: Parallel Hierarchical Transformer with Attention Alignment for Abstractive Multi-Document Summarization

Authors: Ye Ma, Lu Zong

(Submitted on 16 Aug 2022)

Abstract: In comparison to single-document summarization, abstractive Multi-Document Summarization (MDS) brings challenges on the representation and coverage of its lengthy and linked sources. This study develops a Parallel Hierarchical Transformer (PHT) with attention alignment for MDS. By incorporating word- and paragraph-level multi-head attentions, the hierarchical architecture of PHT allows better processing of dependencies at both token and document levels. To guide the decoding towards a better coverage of the source documents, the attention-alignment mechanism is then introduced to calibrate beam search with predicted optimal attention distributions. Based on the WikiSum data, a comprehensive evaluation is conducted to test improvements on MDS by the proposed architecture. By better handling the inner- and cross-document information, results in both ROUGE and human evaluation suggest that our hierarchical model generates summaries of higher quality relative to other Transformer-based baselines at relatively low computational cost.

Comments:	A work in 2020. arXiv admin note: substantial text overlap with arXiv:2009.06891
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2208.07845 [cs.CL]
	(or arXiv:2208.07845v1 [cs.CL] for this version)

Submission history

From: Ye Ma [view email]
[v1] Tue, 16 Aug 2022 17:02:48 GMT (543kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2208.07845

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Parallel Hierarchical Transformer with Attention Alignment for Abstractive Multi-Document Summarization

Submission history