We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: VT-ADL: A Vision Transformer Network for Image Anomaly Detection and Localization

Abstract: We present a transformer-based image anomaly detection and localization network. Our proposed model is a combination of a reconstruction-based approach and patch embedding. The use of transformer networks helps to preserve the spatial information of the embedded patches, which are later processed by a Gaussian mixture density network to localize the anomalous areas. In addition, we also publish BTAD, a real-world industrial anomaly dataset. Our results are compared with other state-of-the-art algorithms using publicly available datasets like MNIST and MVTec.
Comments: 6 Pages, 4 images, conference published paper
Subjects: Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Journal reference: IEEE 30th International Symposium on Industrial Electronics (ISIE), 2021
DOI: 10.1109/ISIE45552.2021.9576231
Report number: KD-003638
Cite as: arXiv:2104.10036 [cs.CV]
  (or arXiv:2104.10036v1 [cs.CV] for this version)

Submission history

From: Pankaj Mishra [view email]
[v1] Tue, 20 Apr 2021 15:12:30 GMT (10304kb,D)

Link back to: arXiv, form interface, contact.