Automatic Summarization of Russian Texts: Comparison of Extractive and Abstractive Methods

Goloviznina, Valeriya; Kotelnikov, Evgeny

doi:10.28995/2075-7182-2022-21-223-235

Full-text links:

Download:

PDF only

Current browse context:

cs.CL

< prev | next >

new | recent | 2206

Change to browse by:

Computer Science > Computation and Language

Title: Automatic Summarization of Russian Texts: Comparison of Extractive and Abstractive Methods

Authors: Valeriya Goloviznina, Evgeny Kotelnikov

(Submitted on 18 Jun 2022)

Abstract: The development of large and super-large language models, such as GPT-3, T5, Switch Transformer, ERNIE, etc., has significantly improved the performance of text generation. One of the important research directions in this area is the generation of texts with arguments. The solution of this problem can be used in business meetings, political debates, dialogue systems, for preparation of student essays. One of the main domains for these applications is the economic sphere. The key problem of the argument text generation for the Russian language is the lack of annotated argumentation corpora. In this paper, we use translated versions of the Argumentative Microtext, Persuasive Essays and UKP Sentential corpora to fine-tune RuBERT model. Further, this model is used to annotate the corpus of economic news by argumentation. Then the annotated corpus is employed to fine-tune the ruGPT-3 model, which generates argument texts. The results show that this approach improves the accuracy of the argument generation by more than 20 percentage points (63.2% vs. 42.5%) compared to the original ruGPT-3 model.

Comments:	Accepted by Dialogue-2022 conference
Subjects:	Computation and Language (cs.CL)
DOI:	10.28995/2075-7182-2022-21-223-235
Cite as:	arXiv:2206.09253 [cs.CL]
	(or arXiv:2206.09253v1 [cs.CL] for this version)

Submission history

From: Evgeny Kotelnikov [view email]
[v1] Sat, 18 Jun 2022 17:28:04 GMT (1389kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2206.09253

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Automatic Summarization of Russian Texts: Comparison of Extractive and Abstractive Methods

Submission history