SegTransVAE: Hybrid CNN -- Transformer with Regularization for medical image segmentation

Pham, Quan-Dung; Nguyen-Truong, Hai; Phuong, Nam Nguyen; Nguyen, Khoa N. A.

Full-text links:

Download:

Current browse context:

eess.IV

< prev | next >

new | recent | 2201

Electrical Engineering and Systems Science > Image and Video Processing

Title: SegTransVAE: Hybrid CNN -- Transformer with Regularization for medical image segmentation

Authors: Quan-Dung Pham (1), Hai Nguyen-Truong (1, 2 and 3), Nam Nguyen Phuong (1), Khoa N. A. Nguyen (1, 2 and 3) ((1) VinBrain JSC., Vietnam, (2) University of Science, Ho Chi Minh City, Vietnam, (3) Vietnam National University, Ho Chi Minh City, Vietnam)

(Submitted on 21 Jan 2022 (this version), latest version 30 Sep 2023 (v4))

Abstract: Current research on deep learning for medical image segmentation exposes their limitations in learning either global semantic information or local contextual information. To tackle these issues, a novel network named SegTransVAE is proposed in this paper. SegTransVAE is built upon encoder-decoder architecture, exploiting transformer with the variational autoencoder (VAE) branch to the network to reconstruct the input images jointly with segmentation. To the best of our knowledge, this is the first method combining the success of CNN, transformer, and VAE. Evaluation on various recently introduced datasets shows that SegTransVAE outperforms previous methods in Dice Score and $95\%$-Haudorff Distance while having comparable inference time to a simple CNN-based architecture network. The source code is available at: this https URL

Comments:	Accepted for publication in 2020 IEEE ISBI: 4 pages, 3 figures
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2201.08582 [eess.IV]
	(or arXiv:2201.08582v1 [eess.IV] for this version)

Submission history

From: Quan Dung Pham [view email]
[v1] Fri, 21 Jan 2022 08:02:55 GMT (1082kb,D)
[v2] Wed, 26 Jan 2022 15:36:21 GMT (1082kb,D)
[v3] Fri, 4 Mar 2022 03:48:21 GMT (0kb,I)
[v4] Sat, 30 Sep 2023 07:01:56 GMT (1150kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2201.08582v1

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Image and Video Processing

Title: SegTransVAE: Hybrid CNN -- Transformer with Regularization for medical image segmentation

Submission history