We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:


References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computer Vision and Pattern Recognition

Title: Privacy-Preserving Image Classification Using Vision Transformer

Abstract: In this paper, we propose a privacy-preserving image classification method that is based on the combined use of encrypted images and the vision transformer (ViT). The proposed method allows us not only to apply images without visual information to ViT models for both training and testing but to also maintain a high classification accuracy. ViT utilizes patch embedding and position embedding for image patches, so this architecture is shown to reduce the influence of block-wise image transformation. In an experiment, the proposed method for privacy-preserving image classification is demonstrated to outperform state-of-the-art methods in terms of classification accuracy and robustness against various attacks.
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2205.12041 [cs.CV]
  (or arXiv:2205.12041v1 [cs.CV] for this version)

Submission history

From: Zheng Qi [view email]
[v1] Tue, 24 May 2022 12:51:48 GMT (5074kb,D)

Link back to: arXiv, form interface, contact.