References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: Learning Deep Representations for Word Spotting Under Weak Supervision
(Submitted on 1 Dec 2017 (v1), last revised 26 Jan 2018 (this version, v3))
Abstract: Convolutional Neural Networks have made their mark in various fields of computer vision in recent years. They have achieved state-of-the-art performance in the field of document analysis as well. However, CNNs require a large amount of annotated training data and, hence, great manual effort. In our approach, we introduce a method to drastically reduce the manual annotation effort while retaining the high performance of a CNN for word spotting in handwritten documents. The model is learned with weak supervision using a combination of synthetically generated training data and a small subset of the training partition of the handwritten data set. We show that the network achieves results highly competitive to the state-of-the-art in word spotting with shorter training times and a fraction of the annotation effort.
Submission history
From: Sebastian Sudholt [view email][v1] Fri, 1 Dec 2017 09:41:13 GMT (631kb,D)
[v2] Mon, 11 Dec 2017 09:56:58 GMT (631kb,D)
[v3] Fri, 26 Jan 2018 10:42:16 GMT (631kb,D)
Link back to: arXiv, form interface, contact.