Training Region-based Object Detectors with Online Hard Example Mining

Shrivastava, Abhinav; Gupta, Abhinav; Girshick, Ross

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 1604

Computer Science > Computer Vision and Pattern Recognition

Title: Training Region-based Object Detectors with Online Hard Example Mining

Authors: Abhinav Shrivastava, Abhinav Gupta, Ross Girshick

(Submitted on 12 Apr 2016)

Abstract: The field of object detection has made significant advances riding on the wave of region-based ConvNets, but their training procedure still includes many heuristics and hyperparameters that are costly to tune. We present a simple yet surprisingly effective online hard example mining (OHEM) algorithm for training region-based ConvNet detectors. Our motivation is the same as it has always been -- detection datasets contain an overwhelming number of easy examples and a small number of hard examples. Automatic selection of these hard examples can make training more effective and efficient. OHEM is a simple and intuitive algorithm that eliminates several heuristics and hyperparameters in common use. But more importantly, it yields consistent and significant boosts in detection performance on benchmarks like PASCAL VOC 2007 and 2012. Its effectiveness increases as datasets become larger and more difficult, as demonstrated by the results on the MS COCO dataset. Moreover, combined with complementary advances in the field, OHEM leads to state-of-the-art results of 78.9% and 76.3% mAP on PASCAL VOC 2007 and 2012 respectively.

Comments:	To appear in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016. (oral)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1604.03540 [cs.CV]
	(or arXiv:1604.03540v1 [cs.CV] for this version)

Submission history

From: Abhinav Shrivastava [view email]
[v1] Tue, 12 Apr 2016 19:44:13 GMT (649kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1604.03540

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Training Region-based Object Detectors with Online Hard Example Mining

Submission history