SimCVD: Simple Contrastive Voxel-Wise Representation Distillation for Semi-Supervised Medical Image Segmentation

You, Chenyu; Zhou, Yuan; Zhao, Ruihan; Staib, Lawrence; Duncan, James S.

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2108

Computer Science > Computer Vision and Pattern Recognition

Title: SimCVD: Simple Contrastive Voxel-Wise Representation Distillation for Semi-Supervised Medical Image Segmentation

Authors: Chenyu You, Yuan Zhou, Ruihan Zhao, Lawrence Staib, James S. Duncan

(Submitted on 13 Aug 2021 (v1), last revised 21 Mar 2022 (this version, v4))

Abstract: Automated segmentation in medical image analysis is a challenging task that requires a large amount of manually labeled data. However, most existing learning-based approaches usually suffer from limited manually annotated medical data, which poses a major practical problem for accurate and robust medical image segmentation. In addition, most existing semi-supervised approaches are usually not robust compared with the supervised counterparts, and also lack explicit modeling of geometric structure and semantic information, both of which limit the segmentation accuracy. In this work, we present SimCVD, a simple contrastive distillation framework that significantly advances state-of-the-art voxel-wise representation learning. We first describe an unsupervised training strategy, which takes two views of an input volume and predicts their signed distance maps of object boundaries in a contrastive objective, with only two independent dropout as mask. This simple approach works surprisingly well, performing on the same level as previous fully supervised methods with much less labeled data. We hypothesize that dropout can be viewed as a minimal form of data augmentation and makes the network robust to representation collapse. Then, we propose to perform structural distillation by distilling pair-wise similarities. We evaluate SimCVD on two popular datasets: the Left Atrial Segmentation Challenge (LA) and the NIH pancreas CT dataset. The results on the LA dataset demonstrate that, in two types of labeled ratios (i.e., 20% and 10%), SimCVD achieves an average Dice score of 90.85% and 89.03% respectively, a 0.91% and 2.22% improvement compared to previous best results. Our method can be trained in an end-to-end fashion, showing the promise of utilizing SimCVD as a general framework for downstream tasks, such as medical image synthesis, enhancement, and registration.

Comments:	IEEE Transactions on Medical Imaging (IEEE-TMI) 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2108.06227 [cs.CV]
	(or arXiv:2108.06227v4 [cs.CV] for this version)

Submission history

From: Chenyu You [view email]
[v1] Fri, 13 Aug 2021 13:17:58 GMT (863kb,D)
[v2] Mon, 16 Aug 2021 15:12:28 GMT (845kb,D)
[v3] Mon, 17 Jan 2022 17:08:26 GMT (892kb,D)
[v4] Mon, 21 Mar 2022 16:43:52 GMT (901kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2108.06227

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: SimCVD: Simple Contrastive Voxel-Wise Representation Distillation for Semi-Supervised Medical Image Segmentation

Submission history