MASKER: Masked Keyword Regularization for Reliable Text Classification

Moon, Seung Jun; Mo, Sangwoo; Lee, Kimin; Lee, Jaeho; Shin, Jinwoo

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2012

Computer Science > Machine Learning

Title: MASKER: Masked Keyword Regularization for Reliable Text Classification

Authors: Seung Jun Moon, Sangwoo Mo, Kimin Lee, Jaeho Lee, Jinwoo Shin

(Submitted on 17 Dec 2020)

Abstract: Pre-trained language models have achieved state-of-the-art accuracies on various text classification tasks, e.g., sentiment analysis, natural language inference, and semantic textual similarity. However, the reliability of the fine-tuned text classifiers is an often underlooked performance criterion. For instance, one may desire a model that can detect out-of-distribution (OOD) samples (drawn far from training distribution) or be robust against domain shifts. We claim that one central obstacle to the reliability is the over-reliance of the model on a limited number of keywords, instead of looking at the whole context. In particular, we find that (a) OOD samples often contain in-distribution keywords, while (b) cross-domain samples may not always contain keywords; over-relying on the keywords can be problematic for both cases. In light of this observation, we propose a simple yet effective fine-tuning method, coined masked keyword regularization (MASKER), that facilitates context-based prediction. MASKER regularizes the model to reconstruct the keywords from the rest of the words and make low-confidence predictions without enough context. When applied to various pre-trained language models (e.g., BERT, RoBERTa, and ALBERT), we demonstrate that MASKER improves OOD detection and cross-domain generalization without degrading classification accuracy. Code is available at this https URL

Comments:	AAAI 2021. First two authors contributed equally
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2012.09392 [cs.LG]
	(or arXiv:2012.09392v1 [cs.LG] for this version)

Submission history

From: Sangwoo Mo [view email]
[v1] Thu, 17 Dec 2020 04:54:16 GMT (2994kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2012.09392

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: MASKER: Masked Keyword Regularization for Reliable Text Classification

Submission history