We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Industrial Scene Text Detection with Refined Feature-attentive Network

Abstract: Detecting the marking characters of industrial metal parts remains challenging due to low visual contrast, uneven illumination, corroded character structures, and cluttered background of metal part images. Affected by these factors, bounding boxes generated by most existing methods locate low-contrast text areas inaccurately. In this paper, we propose a refined feature-attentive network (RFN) to solve the inaccurate localization problem. Specifically, we design a parallel feature integration mechanism to construct an adaptive feature representation from multi-resolution features, which enhances the perception of multi-scale texts at each scale-specific level to generate a high-quality attention map. Then, an attentive refinement network is developed by the attention map to rectify the location deviation of candidate boxes. In addition, a re-scoring mechanism is designed to select text boxes with the best rectified location. Moreover, we construct two industrial scene text datasets, including a total of 102156 images and 1948809 text instances with various character structures and metal parts. Extensive experiments on our dataset and four public datasets demonstrate that our proposed method achieves the state-of-the-art performance.
Subjects: Computer Vision and Pattern Recognition (cs.CV)
DOI: 10.1109/TCSVT.2022.3156390
Cite as: arXiv:2110.12663 [cs.CV]
  (or arXiv:2110.12663v2 [cs.CV] for this version)

Submission history

From: Tongkun Guan [view email]
[v1] Mon, 25 Oct 2021 06:23:44 GMT (6314kb,D)
[v2] Tue, 29 Mar 2022 10:55:56 GMT (10480kb,D)

Link back to: arXiv, form interface, contact.