We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

eess.IV

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Electrical Engineering and Systems Science > Image and Video Processing

Title: Fourier Spectrum Discrepancies in Deep Network Generated Images

Abstract: Advancements in deep generative models such as generative adversarial networks and variational autoencoders have resulted in the ability to generate realistic images that are visually indistinguishable from real images, which raises concerns about their potential malicious usage. In this paper, we present an analysis of the high-frequency Fourier modes of real and deep network generated images and show that deep network generated images share an observable, systematic shortcoming in replicating the attributes of these high-frequency modes. Using this, we propose a detection method based on the frequency spectrum of the images which is able to achieve an accuracy of up to 99.2% in classifying real and deep network generated images from various GAN and VAE architectures on a dataset of 5000 images with as few as 8 training examples. Furthermore, we show the impact of image transformations such as compression, cropping, and resolution reduction on the classification accuracy and suggest a method for modifying the high-frequency attributes of deep network generated images to mimic real images.
Comments: 11 pages, 7 figures
Subjects: Image and Video Processing (eess.IV); Machine Learning (cs.LG); Machine Learning (stat.ML)
Journal reference: Neural Information Processing Systems 33 (2020) 3022-3032
Cite as: arXiv:1911.06465 [eess.IV]
  (or arXiv:1911.06465v3 [eess.IV] for this version)

Submission history

From: Tarik Dzanic [view email]
[v1] Fri, 15 Nov 2019 03:55:12 GMT (4808kb,D)
[v2] Sat, 6 Jun 2020 18:57:52 GMT (6399kb,D)
[v3] Thu, 22 Oct 2020 17:29:32 GMT (6401kb,D)

Link back to: arXiv, form interface, contact.