Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data

Islam, Ashraful; Chen, Chun-Fu; Panda, Rameswar; Karlinsky, Leonid; Feris, Rogerio; Radke, Richard J.

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2106

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data

Authors: Ashraful Islam, Chun-Fu Chen, Rameswar Panda, Leonid Karlinsky, Rogerio Feris, Richard J. Radke

(Submitted on 14 Jun 2021 (v1), last revised 1 Nov 2021 (this version, v3))

Abstract: Most existing works in few-shot learning rely on meta-learning the network on a large base dataset which is typically from the same domain as the target dataset. We tackle the problem of cross-domain few-shot learning where there is a large shift between the base and target domain. The problem of cross-domain few-shot recognition with unlabeled target data is largely unaddressed in the literature. STARTUP was the first method that tackles this problem using self-training. However, it uses a fixed teacher pretrained on a labeled base dataset to create soft labels for the unlabeled target samples. As the base dataset and unlabeled dataset are from different domains, projecting the target images in the class-domain of the base dataset with a fixed pretrained model might be sub-optimal. We propose a simple dynamic distillation-based approach to facilitate unlabeled images from the novel/base dataset. We impose consistency regularization by calculating predictions from the weakly-augmented versions of the unlabeled images from a teacher network and matching it with the strongly augmented versions of the same images from a student network. The parameters of the teacher network are updated as exponential moving average of the parameters of the student network. We show that the proposed network learns representation that can be easily adapted to the target domain even though it has not been trained with target-specific classes during the pretraining phase. Our model outperforms the current state-of-the art method by 4.4% for 1-shot and 3.6% for 5-shot classification in the BSCD-FSL benchmark, and also shows competitive performance on traditional in-domain few-shot learning task.

Comments:	Accepted to NeurIPS 2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2106.07807 [cs.CV]
	(or arXiv:2106.07807v3 [cs.CV] for this version)

Submission history

From: Ashraful Islam [view email]
[v1] Mon, 14 Jun 2021 23:44:34 GMT (2955kb,D)
[v2] Tue, 26 Oct 2021 15:32:54 GMT (2956kb,D)
[v3] Mon, 1 Nov 2021 04:28:04 GMT (2957kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2106.07807

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Dynamic Distillation Network for Cross-Domain Few-Shot Recognition with Unlabeled Data

Submission history