We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: SSD: Single Shot MultiBox Detector

Abstract: We present a method for detecting objects in images using a single deep neural network. Our approach, named SSD, discretizes the output space of bounding boxes into a set of bounding box priors over different aspect ratios and scales per feature map location. At prediction time, the network generates confidences that each prior corresponds to objects of interest and produces adjustments to the prior to better match the object shape. Additionally, the network combines predictions from multiple feature maps with different resolutions to naturally handle objects of various sizes. Our SSD model is simple relative to methods that requires object proposals, such as R-CNN and MultiBox, because it completely discards the proposal generation step and encapsulates all the computation in a single network. This makes SSD easy to train and straightforward to integrate into systems that require a detection component. Experimental results on ILSVRC DET and PASCAL VOC dataset confirm that SSD has comparable performance with methods that utilize an additional object proposal step and yet is 100-1000x faster. Compared to other single stage methods, SSD has similar or better performance, while providing a unified framework for both training and inference.
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:1512.02325 [cs.CV]
  (or arXiv:1512.02325v1 [cs.CV] for this version)

Submission history

From: Wei Liu [view email]
[v1] Tue, 8 Dec 2015 04:46:38 GMT (285kb,D)
[v2] Wed, 30 Mar 2016 21:17:34 GMT (2230kb,D)
[v3] Tue, 8 Nov 2016 18:31:25 GMT (2699kb,D)
[v4] Wed, 30 Nov 2016 09:54:02 GMT (2769kb,D)
[v5] Thu, 29 Dec 2016 19:05:11 GMT (2711kb,D)

Link back to: arXiv, form interface, contact.