Fine-tuning BERT for Low-Resource Natural Language Understanding via Active Learning

Grießhaber, Daniel; Maucher, Johannes; Vu, Ngoc Thang

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2012

Change to browse by:

Computer Science > Computation and Language

Title: Fine-tuning BERT for Low-Resource Natural Language Understanding via Active Learning

Authors: Daniel Grießhaber, Johannes Maucher, Ngoc Thang Vu

(Submitted on 4 Dec 2020)

Abstract: Recently, leveraging pre-trained Transformer based language models in down stream, task specific models has advanced state of the art results in natural language understanding tasks. However, only a little research has explored the suitability of this approach in low resource settings with less than 1,000 training data points. In this work, we explore fine-tuning methods of BERT -- a pre-trained Transformer based language model -- by utilizing pool-based active learning to speed up training while keeping the cost of labeling new data constant. Our experimental results on the GLUE data set show an advantage in model performance by maximizing the approximate knowledge gain of the model when querying from the pool of unlabeled data. Finally, we demonstrate and analyze the benefits of freezing layers of the language model during fine-tuning to reduce the number of trainable parameters, making it more suitable for low-resource settings.

Comments:	COLING'2020
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2012.02462 [cs.CL]
	(or arXiv:2012.02462v1 [cs.CL] for this version)

Submission history

From: Daniel Grießhaber [view email]
[v1] Fri, 4 Dec 2020 08:34:39 GMT (193kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2012.02462

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Fine-tuning BERT for Low-Resource Natural Language Understanding via Active Learning

Submission history