DoubleU-Net: A Deep Convolutional Neural Network for Medical Image Segmentation

Jha, Debesh; Riegler, Michael A.; Johansen, Dag; Halvorsen, Pål; Johansen, Håvard D.

Full-text links:

Download:

Current browse context:

eess.IV

< prev | next >

new | recent | 2006

Electrical Engineering and Systems Science > Image and Video Processing

Title: DoubleU-Net: A Deep Convolutional Neural Network for Medical Image Segmentation

Authors: Debesh Jha, Michael A. Riegler, Dag Johansen, Pål Halvorsen, Håvard D. Johansen

(Submitted on 8 Jun 2020 (v1), last revised 27 Jun 2020 (this version, v2))

Abstract: Semantic image segmentation is the process of labeling each pixel of an image with its corresponding class. An encoder-decoder based approach, like U-Net and its variants, is a popular strategy for solving medical image segmentation tasks. To improve the performance of U-Net on various segmentation tasks, we propose a novel architecture called DoubleU-Net, which is a combination of two U-Net architectures stacked on top of each other. The first U-Net uses a pre-trained VGG-19 as the encoder, which has already learned features from ImageNet and can be transferred to another task easily. To capture more semantic information efficiently, we added another U-Net at the bottom. We also adopt Atrous Spatial Pyramid Pooling (ASPP) to capture contextual information within the network. We have evaluated DoubleU-Net using four medical segmentation datasets, covering various imaging modalities such as colonoscopy, dermoscopy, and microscopy. Experiments on the MICCAI 2015 segmentation challenge, the CVC-ClinicDB, the 2018 Data Science Bowl challenge, and the Lesion boundary segmentation datasets demonstrate that the DoubleU-Net outperforms U-Net and the baseline models. Moreover, DoubleU-Net produces more accurate segmentation masks, especially in the case of the CVC-ClinicDB and MICCAI 2015 segmentation challenge datasets, which have challenging images such as smaller and flat polyps. These results show the improvement over the existing U-Net model. The encouraging results, produced on various medical image segmentation datasets, show that DoubleU-Net can be used as a strong baseline for both medical image segmentation and cross-dataset evaluation testing to measure the generalizability of Deep Learning (DL) models.

Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2006.04868 [eess.IV]
	(or arXiv:2006.04868v2 [eess.IV] for this version)

Submission history

From: Debesh Jha [view email]
[v1] Mon, 8 Jun 2020 18:38:24 GMT (4083kb,D)
[v2] Sat, 27 Jun 2020 15:40:40 GMT (4083kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2006.04868

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Image and Video Processing

Title: DoubleU-Net: A Deep Convolutional Neural Network for Medical Image Segmentation

Submission history