We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: A CNN Based Scene Chinese Text Recognition Algorithm With Synthetic Data Engine

Abstract: Scene text recognition plays an important role in many computer vision applications. The small size of available public available scene text datasets is the main challenge when training a text recognition CNN model. In this paper, we propose a CNN based Chinese text recognition algorithm. To enlarge the dataset for training the CNN model, we design a synthetic data engine for Chinese scene character generation, which generates representative character images according to the fonts use frequency of Chinese texts. As the Chinese text is more complex, the English text recognition CNN architecture is modified for Chinese text. To ensure the small size nature character dataset and the large size artificial character dataset are comparable in training, the CNN model are trained progressively. The proposed Chinese text recognition algorithm is evaluated with two Chinese text datasets. The algorithm achieves better recognize accuracy compared to the baseline methods.
Comments: 2 pages, DAS 2016 short paper
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:1604.01891 [cs.CV]
  (or arXiv:1604.01891v1 [cs.CV] for this version)

Submission history

From: Xiaohang Ren [view email]
[v1] Thu, 7 Apr 2016 07:08:25 GMT (82kb,D)

Link back to: arXiv, form interface, contact.