Fourier Image Transformer

Buchholz, Tim-Oliver; Jug, Florian

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2104

Computer Science > Computer Vision and Pattern Recognition

Title: Fourier Image Transformer

Authors: Tim-Oliver Buchholz, Florian Jug

(Submitted on 6 Apr 2021 (v1), last revised 19 Apr 2022 (this version, v3))

Abstract: Transformer architectures show spectacular performance on NLP tasks and have recently also been used for tasks such as image completion or image classification. Here we propose to use a sequential image representation, where each prefix of the complete sequence describes the whole image at reduced resolution. Using such Fourier Domain Encodings (FDEs), an auto-regressive image completion task is equivalent to predicting a higher resolution output given a low-resolution input. Additionally, we show that an encoder-decoder setup can be used to query arbitrary Fourier coefficients given a set of Fourier domain observations. We demonstrate the practicality of this approach in the context of computed tomography (CT) image reconstruction. In summary, we show that Fourier Image Transformer (FIT) can be used to solve relevant image analysis tasks in Fourier space, a domain inherently inaccessible to convolutional architectures.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:2104.02555 [cs.CV]
	(or arXiv:2104.02555v3 [cs.CV] for this version)

Submission history

From: Florian Jug [view email]
[v1] Tue, 6 Apr 2021 14:48:57 GMT (3268kb,D)
[v2] Mon, 3 May 2021 10:29:54 GMT (3268kb,D)
[v3] Tue, 19 Apr 2022 15:45:32 GMT (3526kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2104.02555

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Fourier Image Transformer

Submission history