Distortion Robust Image Classification using Deep Convolutional Neural Network with Discrete Cosine Transform

Hossain, Md Tahmid; Teng, Shyh Wei; Zhang, Dengsheng; Lim, Suryani; Lu, Guojun

doi:10.1109/ICIP.2019.8803787

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 1811

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Distortion Robust Image Classification using Deep Convolutional Neural Network with Discrete Cosine Transform

Authors: Md Tahmid Hossain, Shyh Wei Teng, Dengsheng Zhang, Suryani Lim, Guojun Lu

(Submitted on 14 Nov 2018 (v1), last revised 6 Aug 2020 (this version, v4))

Abstract: Convolutional Neural Network is good at image classification. However, it is found to be vulnerable to image quality degradation. Even a small amount of distortion such as noise or blur can severely hamper the performance of these CNN architectures. Most of the work in the literature strives to mitigate this problem simply by fine-tuning a pre-trained CNN on mutually exclusive or a union set of distorted training data. This iterative fine-tuning process with all known types of distortion is exhaustive and the network struggles to handle unseen distortions. In this work, we propose distortion robust DCT-Net, a Discrete Cosine Transform based module integrated into a deep network which is built on top of VGG16. Unlike other works in the literature, DCT-Net is "blind" to the distortion type and level in an image both during training and testing. As a part of the training process, the proposed DCT module discards input information which mostly represents the contribution of high frequencies. The DCT-Net is trained "blindly" only once and applied in generic situation without further retraining. We also extend the idea of traditional dropout and present a training adaptive version of the same. We evaluate our proposed method against Gaussian blur, motion blur, salt and pepper noise, Gaussian noise and speckle noise added to CIFAR-10/100 and ImageNet test sets. Experimental results demonstrate that once trained, DCT-Net not only generalizes well to a variety of unseen image distortions but also outperforms other methods in the literature.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
DOI:	10.1109/ICIP.2019.8803787
Cite as:	arXiv:1811.05819 [cs.CV]
	(or arXiv:1811.05819v4 [cs.CV] for this version)

Submission history

From: Md Tahmid Hossain [view email]
[v1] Wed, 14 Nov 2018 14:52:06 GMT (4234kb)
[v2] Mon, 19 Nov 2018 11:48:11 GMT (4740kb)
[v3] Thu, 23 Jul 2020 03:07:57 GMT (4887kb)
[v4] Thu, 6 Aug 2020 09:32:41 GMT (6798kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1811.05819

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Distortion Robust Image Classification using Deep Convolutional Neural Network with Discrete Cosine Transform

Submission history