Understanding Self-Training for Gradual Domain Adaptation

Kumar, Ananya; Ma, Tengyu; Liang, Percy

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2002

Computer Science > Machine Learning

Title: Understanding Self-Training for Gradual Domain Adaptation

Authors: Ananya Kumar, Tengyu Ma, Percy Liang

(Submitted on 26 Feb 2020)

Abstract: Machine learning systems must adapt to data distributions that evolve over time, in applications ranging from sensor networks and self-driving car perception modules to brain-machine interfaces. We consider gradual domain adaptation, where the goal is to adapt an initial classifier trained on a source domain given only unlabeled data that shifts gradually in distribution towards a target domain. We prove the first non-vacuous upper bound on the error of self-training with gradual shifts, under settings where directly adapting to the target domain can result in unbounded error. The theoretical analysis leads to algorithmic insights, highlighting that regularization and label sharpening are essential even when we have infinite data, and suggesting that self-training works particularly well for shifts with small Wasserstein-infinity distance. Leveraging the gradual shift structure leads to higher accuracies on a rotating MNIST dataset and a realistic Portraits dataset.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2002.11361 [cs.LG]
	(or arXiv:2002.11361v1 [cs.LG] for this version)

Submission history

From: Ananya Kumar [view email]
[v1] Wed, 26 Feb 2020 08:59:40 GMT (174kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2002.11361

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Understanding Self-Training for Gradual Domain Adaptation

Submission history