We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Improving Long Handwritten Text Line Recognition with Convolutional Multi-way Associative Memory

Abstract: Convolutional Recurrent Neural Networks (CRNNs) excel at scene text recognition. Unfortunately, they are likely to suffer from vanishing/exploding gradient problems when processing long text images, which are commonly found in scanned documents. This poses a major challenge to goal of completely solving Optical Character Recognition (OCR) problem. Inspired by recently proposed memory-augmented neural networks (MANNs) for long-term sequential modeling, we present a new architecture dubbed Convolutional Multi-way Associative Memory (CMAM) to tackle the limitation of current CRNNs. By leveraging recent memory accessing mechanisms in MANNs, our architecture demonstrates superior performance against other CRNN counterparts in three real-world long text OCR datasets.
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:1911.01577 [cs.CV]
  (or arXiv:1911.01577v2 [cs.CV] for this version)

Submission history

From: Duc Nguyen [view email]
[v1] Tue, 5 Nov 2019 02:42:09 GMT (409kb,D)
[v2] Wed, 22 Jan 2020 06:46:13 GMT (441kb,D)

Link back to: arXiv, form interface, contact.