RPLHR-CT Dataset and Transformer Baseline for Volumetric Super-Resolution from CT Scans

Yu, Pengxin; Zhang, Haoyue; Kang, Han; Tang, Wen; Arnold, Corey W.; Zhang, Rongguo

doi:10.1007/978-3-031-16446-0_33

Full-text links:

Download:

Current browse context:

eess.IV

< prev | next >

new | recent | 2206

Electrical Engineering and Systems Science > Image and Video Processing

Title: RPLHR-CT Dataset and Transformer Baseline for Volumetric Super-Resolution from CT Scans

Authors: Pengxin Yu, Haoyue Zhang, Han Kang, Wen Tang, Corey W. Arnold, Rongguo Zhang

(Submitted on 13 Jun 2022)

Abstract: In clinical practice, anisotropic volumetric medical images with low through-plane resolution are commonly used due to short acquisition time and lower storage cost. Nevertheless, the coarse resolution may lead to difficulties in medical diagnosis by either physicians or computer-aided diagnosis algorithms. Deep learning-based volumetric super-resolution (SR) methods are feasible ways to improve resolution, with convolutional neural networks (CNN) at their core. Despite recent progress, these methods are limited by inherent properties of convolution operators, which ignore content relevance and cannot effectively model long-range dependencies. In addition, most of the existing methods use pseudo-paired volumes for training and evaluation, where pseudo low-resolution (LR) volumes are generated by a simple degradation of their high-resolution (HR) counterparts. However, the domain gap between pseudo- and real-LR volumes leads to the poor performance of these methods in practice. In this paper, we build the first public real-paired dataset RPLHR-CT as a benchmark for volumetric SR, and provide baseline results by re-implementing four state-of-the-art CNN-based methods. Considering the inherent shortcoming of CNN, we also propose a transformer volumetric super-resolution network (TVSRN) based on attention mechanisms, dispensing with convolutions entirely. This is the first research to use a pure transformer for CT volumetric SR. The experimental results show that TVSRN significantly outperforms all baselines on both PSNR and SSIM. Moreover, the TVSRN method achieves a better trade-off between the image quality, the number of parameters, and the running time. Data and code are available at this https URL

Comments:	Accepted MICCAI 2022
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
DOI:	10.1007/978-3-031-16446-0_33
Cite as:	arXiv:2206.06253 [eess.IV]
	(or arXiv:2206.06253v1 [eess.IV] for this version)

Submission history

From: Haoyue Zhang [view email]
[v1] Mon, 13 Jun 2022 15:35:59 GMT (5696kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> eess > arXiv:2206.06253

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Electrical Engineering and Systems Science > Image and Video Processing

Title: RPLHR-CT Dataset and Transformer Baseline for Volumetric Super-Resolution from CT Scans

Submission history