Text Augmentation in a Multi-Task View

Wei, Jason; Huang, Chengyu; Xu, Shiqi; Vosoughi, Soroush

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2101

Change to browse by:

Computer Science > Computation and Language

Title: Text Augmentation in a Multi-Task View

Authors: Jason Wei, Chengyu Huang, Shiqi Xu, Soroush Vosoughi

(Submitted on 14 Jan 2021)

Abstract: Traditional data augmentation aims to increase the coverage of the input distribution by generating augmented examples that strongly resemble original samples in an online fashion where augmented examples dominate training. In this paper, we propose an alternative perspective -- a multi-task view (MTV) of data augmentation -- in which the primary task trains on original examples and the auxiliary task trains on augmented examples. In MTV data augmentation, both original and augmented samples are weighted substantively during training, relaxing the constraint that augmented examples must resemble original data and thereby allowing us to apply stronger levels of augmentation. In empirical experiments using four common data augmentation techniques on three benchmark text classification datasets, we find that the MTV leads to higher and more robust performance improvements than traditional augmentation.

Comments:	Accepted to EACL 2021
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2101.05469 [cs.CL]
	(or arXiv:2101.05469v1 [cs.CL] for this version)

Submission history

From: Jason Wei [view email]
[v1] Thu, 14 Jan 2021 05:59:23 GMT (7355kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2101.05469

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Text Augmentation in a Multi-Task View

Submission history