Privacy-Preserving Image Classification Using Vision Transformer

Qi, Zheng; MaungMaung, AprilPyone; Kinoshita, Yuma; Kiya, Hitoshi

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2205

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Privacy-Preserving Image Classification Using Vision Transformer

Authors: Zheng Qi, AprilPyone MaungMaung, Yuma Kinoshita, Hitoshi Kiya

(Submitted on 24 May 2022)

Abstract: In this paper, we propose a privacy-preserving image classification method that is based on the combined use of encrypted images and the vision transformer (ViT). The proposed method allows us not only to apply images without visual information to ViT models for both training and testing but to also maintain a high classification accuracy. ViT utilizes patch embedding and position embedding for image patches, so this architecture is shown to reduce the influence of block-wise image transformation. In an experiment, the proposed method for privacy-preserving image classification is demonstrated to outperform state-of-the-art methods in terms of classification accuracy and robustness against various attacks.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2205.12041 [cs.CV]
	(or arXiv:2205.12041v1 [cs.CV] for this version)

Submission history

From: Zheng Qi [view email]
[v1] Tue, 24 May 2022 12:51:48 GMT (5074kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2205.12041

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Privacy-Preserving Image Classification Using Vision Transformer

Submission history