Exploiting Unlabeled Data for Target-Oriented Opinion Words Extraction

Wang, Yidong; Wu, Hao; Liu, Ao; Hou, Wenxin; Wu, Zhen; Wang, Jindong; Shinozaki, Takahiro; Okumura, Manabu; Zhang, Yue

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2208

Change to browse by:

Computer Science > Computation and Language

Title: Exploiting Unlabeled Data for Target-Oriented Opinion Words Extraction

Authors: Yidong Wang, Hao Wu, Ao Liu, Wenxin Hou, Zhen Wu, Jindong Wang, Takahiro Shinozaki, Manabu Okumura, Yue Zhang

(Submitted on 17 Aug 2022)

Abstract: Target-oriented Opinion Words Extraction (TOWE) is a fine-grained sentiment analysis task that aims to extract the corresponding opinion words of a given opinion target from the sentence. Recently, deep learning approaches have made remarkable progress on this task. Nevertheless, the TOWE task still suffers from the scarcity of training data due to the expensive data annotation process. Limited labeled data increase the risk of distribution shift between test data and training data. In this paper, we propose exploiting massive unlabeled data to reduce the risk by increasing the exposure of the model to varying distribution shifts. Specifically, we propose a novel Multi-Grained Consistency Regularization (MGCR) method to make use of unlabeled data and design two filters specifically for TOWE to filter noisy data at different granularity. Extensive experimental results on four TOWE benchmark datasets indicate the superiority of MGCR compared with current state-of-the-art methods. The in-depth analysis also demonstrates the effectiveness of the different-granularity filters. Our codes are available at this https URL

Comments:	Accepted by COLING 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2208.08280 [cs.CL]
	(or arXiv:2208.08280v1 [cs.CL] for this version)

Submission history

From: Zhen Wu [view email]
[v1] Wed, 17 Aug 2022 13:19:26 GMT (1148kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2208.08280

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Exploiting Unlabeled Data for Target-Oriented Opinion Words Extraction

Submission history