We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Compositional Sketch Search

Abstract: We present an algorithm for searching image collections using free-hand sketches that describe the appearance and relative positions of multiple objects. Sketch based image retrieval (SBIR) methods predominantly match queries containing a single, dominant object invariant to its position within an image. Our work exploits drawings as a concise and intuitive representation for specifying entire scene compositions. We train a convolutional neural network (CNN) to encode masked visual features from sketched objects, pooling these into a spatial descriptor encoding the spatial relationships and appearances of objects in the composition. Training the CNN backbone as a Siamese network under triplet loss yields a metric search embedding for measuring compositional similarity which may be efficiently leveraged for visual search by applying product quantization.
Comments: ICIP 2021 camera-ready version
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2106.08009 [cs.CV]
  (or arXiv:2106.08009v1 [cs.CV] for this version)

Submission history

From: Alexander Black [view email]
[v1] Tue, 15 Jun 2021 09:38:09 GMT (9947kb,D)

Link back to: arXiv, form interface, contact.