One Hyper-Initializer for All Network Architectures in Medical Image Analysis

Shang, Fangxin; Yang, Yehui; Yang, Dalu; Wu, Junde; Wang, Xiaorong; Xu, Yanwu

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2206

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: One Hyper-Initializer for All Network Architectures in Medical Image Analysis

Authors: Fangxin Shang, Yehui Yang, Dalu Yang, Junde Wu, Xiaorong Wang, Yanwu Xu

(Submitted on 8 Jun 2022)

Abstract: Pre-training is essential to deep learning model performance, especially in medical image analysis tasks where limited training data are available. However, existing pre-training methods are inflexible as the pre-trained weights of one model cannot be reused by other network architectures. In this paper, we propose an architecture-irrelevant hyper-initializer, which can initialize any given network architecture well after being pre-trained for only once. The proposed initializer is a hypernetwork which takes a downstream architecture as input graphs and outputs the initialization parameters of the respective architecture. We show the effectiveness and efficiency of the hyper-initializer through extensive experimental results on multiple medical imaging modalities, especially in data-limited fields. Moreover, we prove that the proposed algorithm can be reused as a favorable plug-and-play initializer for any downstream architecture and task (both classification and segmentation) of the same modality.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2206.03661 [cs.CV]
	(or arXiv:2206.03661v1 [cs.CV] for this version)

Submission history

From: Yehui Yang [view email]
[v1] Wed, 8 Jun 2022 03:18:55 GMT (4422kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2206.03661

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: One Hyper-Initializer for All Network Architectures in Medical Image Analysis

Submission history