CLEAR: Contrastive Learning for Sentence Representation

Wu, Zhuofeng; Wang, Sinong; Gu, Jiatao; Khabsa, Madian; Sun, Fei; Ma, Hao

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2012

Change to browse by:

Computer Science > Computation and Language

Title: CLEAR: Contrastive Learning for Sentence Representation

Authors: Zhuofeng Wu, Sinong Wang, Jiatao Gu, Madian Khabsa, Fei Sun, Hao Ma

(Submitted on 31 Dec 2020)

Abstract: Pre-trained language models have proven their unique powers in capturing implicit language features. However, most pre-training approaches focus on the word-level training objective, while sentence-level objectives are rarely studied. In this paper, we propose Contrastive LEArning for sentence Representation (CLEAR), which employs multiple sentence-level augmentation strategies in order to learn a noise-invariant sentence representation. These augmentations include word and span deletion, reordering, and substitution. Furthermore, we investigate the key reasons that make contrastive learning effective through numerous experiments. We observe that different sentence augmentations during pre-training lead to different performance improvements on various downstream tasks. Our approach is shown to outperform multiple existing methods on both SentEval and GLUE benchmarks.

Comments:	10 pages, 2 figures
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2012.15466 [cs.CL]
	(or arXiv:2012.15466v1 [cs.CL] for this version)

Submission history

From: Zhuofeng Wu [view email]
[v1] Thu, 31 Dec 2020 06:40:13 GMT (39kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2012.15466

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: CLEAR: Contrastive Learning for Sentence Representation

Submission history