Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition

Yoshihashi, Ryota; Tanaka, Tomohiro; Doi, Kenji; Fujino, Takumi; Yamashita, Naoaki

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2106

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition

Authors: Ryota Yoshihashi, Tomohiro Tanaka, Kenji Doi, Takumi Fujino, Naoaki Yamashita

(Submitted on 10 Jun 2021)

Abstract: In the deployment of scene-text spotting systems on mobile platforms, lightweight models with low computation are preferable. In concept, end-to-end (E2E) text spotting is suitable for such purposes because it performs text detection and recognition in a single model. However, current state-of-the-art E2E methods rely on heavy feature extractors, recurrent sequence modellings, and complex shape aligners to pursue accuracy, which means their computations are still heavy. We explore the opposite direction: How far can we go without bells and whistles in E2E text spotting? To this end, we propose a text-spotting method that consists of simple convolutions and a few post-processes, named Context-Free TextSpotter. Experiments using standard benchmarks show that Context-Free TextSpotter achieves real-time text spotting on a GPU with only three million parameters, which is the smallest and fastest among existing deep text spotters, with an acceptable transcription quality degradation compared to heavier ones. Further, we demonstrate that our text spotter can run on a smartphone with affordable latency, which is valuable for building stand-alone OCR applications.

Comments:	To appear in ICDAR2021
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2106.05611 [cs.CV]
	(or arXiv:2106.05611v1 [cs.CV] for this version)

Submission history

From: Ryota Yoshihashi [view email]
[v1] Thu, 10 Jun 2021 09:32:52 GMT (29995kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2106.05611

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition

Submission history