We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computer Vision and Pattern Recognition

Title: Learning Deep Representations for Word Spotting Under Weak Supervision

Abstract: Convolutional Neural Networks have made their mark in various fields of computer vision in recent years. They have achieved state-of-the-art performance in the field of document analysis as well. However, CNNs require a large amount of annotated training data and, hence, great manual effort. In our approach, we introduce a method to drastically reduce the manual annotation effort while retaining the high performance of a CNN for word spotting in handwritten documents. The model is learned with weak supervision using a combination of synthetically generated training data and a small subset of the training partition of the handwritten data set. We show that the network achieves results highly competitive to the state-of-the-art in word spotting with shorter training times and a fraction of the annotation effort.
Comments: submitted to DAS 2018
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:1712.00250 [cs.CV]
  (or arXiv:1712.00250v3 [cs.CV] for this version)

Submission history

From: Sebastian Sudholt [view email]
[v1] Fri, 1 Dec 2017 09:41:13 GMT (631kb,D)
[v2] Mon, 11 Dec 2017 09:56:58 GMT (631kb,D)
[v3] Fri, 26 Jan 2018 10:42:16 GMT (631kb,D)

Link back to: arXiv, form interface, contact.