Enhancing Pre-trained Language Model with Lexical Simplification

Bao, Rongzhou; Wang, Jiayi; Zhang, Zhuosheng; Zhao, Hai

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2012

Change to browse by:

Computer Science > Computation and Language

Title: Enhancing Pre-trained Language Model with Lexical Simplification

Authors: Rongzhou Bao, Jiayi Wang, Zhuosheng Zhang, Hai Zhao

(Submitted on 30 Dec 2020)

Abstract: For both human readers and pre-trained language models (PrLMs), lexical diversity may lead to confusion and inaccuracy when understanding the underlying semantic meanings of given sentences. By substituting complex words with simple alternatives, lexical simplification (LS) is a recognized method to reduce such lexical diversity, and therefore to improve the understandability of sentences. In this paper, we leverage LS and propose a novel approach which can effectively improve the performance of PrLMs in text classification. A rule-based simplification process is applied to a given sentence. PrLMs are encouraged to predict the real label of the given sentence with auxiliary inputs from the simplified version. Using strong PrLMs (BERT and ELECTRA) as baselines, our approach can still further improve the performance in various text classification tasks.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2012.15070 [cs.CL]
	(or arXiv:2012.15070v1 [cs.CL] for this version)

Submission history

From: Rongzhou Bao [view email]
[v1] Wed, 30 Dec 2020 07:49:00 GMT (349kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2012.15070

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Enhancing Pre-trained Language Model with Lexical Simplification

Submission history