Simple Yet Effective Synthetic Dataset Construction for Unsupervised Opinion Summarization

Shen, Ming; Ma, Jie; Wang, Shuai; Vyas, Yogarshi; Dixit, Kalpit; Ballesteros, Miguel; Benajiba, Yassine

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2303

Change to browse by:

Computer Science > Computation and Language

Title: Simple Yet Effective Synthetic Dataset Construction for Unsupervised Opinion Summarization

Authors: Ming Shen, Jie Ma, Shuai Wang, Yogarshi Vyas, Kalpit Dixit, Miguel Ballesteros, Yassine Benajiba

(Submitted on 21 Mar 2023)

Abstract: Opinion summarization provides an important solution for summarizing opinions expressed among a large number of reviews. However, generating aspect-specific and general summaries is challenging due to the lack of annotated data. In this work, we propose two simple yet effective unsupervised approaches to generate both aspect-specific and general opinion summaries by training on synthetic datasets constructed with aspect-related review contents. Our first approach, Seed Words Based Leave-One-Out (SW-LOO), identifies aspect-related portions of reviews simply by exact-matching aspect seed words and outperforms existing methods by 3.4 ROUGE-L points on SPACE and 0.5 ROUGE-1 point on OPOSUM+ for aspect-specific opinion summarization. Our second approach, Natural Language Inference Based Leave-One-Out (NLI-LOO) identifies aspect-related sentences utilizing an NLI model in a more general setting without using seed words and outperforms existing approaches by 1.2 ROUGE-L points on SPACE for aspect-specific opinion summarization and remains competitive on other metrics.

Comments:	EACL 2023 Findings
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2303.11660 [cs.CL]
	(or arXiv:2303.11660v1 [cs.CL] for this version)

Submission history

From: Ming Shen [view email]
[v1] Tue, 21 Mar 2023 08:08:04 GMT (6786kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2303.11660

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Simple Yet Effective Synthetic Dataset Construction for Unsupervised Opinion Summarization

Submission history