Deep Reinforced Self-Attention Masks for Abstractive Summarization (DR.SAS)

Chadha, Ankit; Masoud, Mohamed

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2001

Change to browse by:

Computer Science > Computation and Language

Title: Deep Reinforced Self-Attention Masks for Abstractive Summarization (DR.SAS)

Authors: Ankit Chadha, Mohamed Masoud

(Submitted on 30 Dec 2019)

Abstract: We present a novel architectural scheme to tackle the abstractive summarization problem based on the CNN/DMdataset which fuses Reinforcement Learning (RL) withUniLM, which is a pre-trained Deep Learning Model, to solve various natural language tasks. We have tested the limits of learning fine-grained attention in Transformers to improve the summarization quality. UniLM applies attention to the entire token space in a global fashion. We propose DR.SAS which applies the Actor-Critic (AC) algorithm to learn a dynamic self-attention distribution over the tokens to reduce redundancy and generate factual and coherent summaries to improve the quality of summarization. After performing hyperparameter tuning, we achievedbetter ROUGE results compared to the baseline. Our model tends to be more extractive/factual yet coherent in detail because of optimization over ROUGE rewards. We present detailed error analysis with examples of the strengths and limitations of our model. Our codebase will be publicly available on our GitHub.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2001.00009 [cs.CL]
	(or arXiv:2001.00009v1 [cs.CL] for this version)

Submission history

From: Ankit Chadha Mr. [view email]
[v1] Mon, 30 Dec 2019 01:32:42 GMT (4388kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2001.00009

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Deep Reinforced Self-Attention Masks for Abstractive Summarization (DR.SAS)

Submission history