VarMAE: Pre-training of Variational Masked Autoencoder for Domain-adaptive Language Understanding

Hu, Dou; Hou, Xiaolong; Du, Xiyang; Zhou, Mengyuan; Jiang, Lianxin; Mo, Yang; Shi, Xiaofeng

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2211

Computer Science > Computation and Language

Title: VarMAE: Pre-training of Variational Masked Autoencoder for Domain-adaptive Language Understanding

Authors: Dou Hu, Xiaolong Hou, Xiyang Du, Mengyuan Zhou, Lianxin Jiang, Yang Mo, Xiaofeng Shi

(Submitted on 1 Nov 2022)

Abstract: Pre-trained language models have achieved promising performance on general benchmarks, but underperform when migrated to a specific domain. Recent works perform pre-training from scratch or continual pre-training on domain corpora. However, in many specific domains, the limited corpus can hardly support obtaining precise representations. To address this issue, we propose a novel Transformer-based language model named VarMAE for domain-adaptive language understanding. Under the masked autoencoding objective, we design a context uncertainty learning module to encode the token's context into a smooth latent distribution. The module can produce diverse and well-formed contextual representations. Experiments on science- and finance-domain NLU tasks demonstrate that VarMAE can be efficiently adapted to new domains with limited resources.

Comments:	11 pages, accepted by Findings of EMNLP 2022
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2211.00430 [cs.CL]
	(or arXiv:2211.00430v1 [cs.CL] for this version)

Submission history

From: Dou Hu [view email]
[v1] Tue, 1 Nov 2022 12:51:51 GMT (422kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2211.00430

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: VarMAE: Pre-training of Variational Masked Autoencoder for Domain-adaptive Language Understanding

Submission history