We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: VRFP: On-the-fly Video Retrieval using Web Images and Fast Fisher Vector Products

Abstract: VRFP is a real-time video retrieval framework based on short text input queries in which weakly labeled training samples from the web are obtained, after the query is known. Our experiments show that a Fisher Vector is robust to noise present in web-images and compares favorably in terms of accuracy to other standard representations. While a Fisher Vector for a new query can be constructed efficiently, matching against the test set is slow due to its high dimensionality. To perform matching in real-time, we present a lossless algorithm for accelerating the computation of dot product between high dimensional Fisher Vectors. We prove that the expected number of multiplications required is quadratic in terms of sparsity in Fisher Vectors. We are not only able to construct and apply query models in real-time, but with the help of a simple re-ranking scheme, we also outperform state-of-the-art automatic retrieval methods by a significant margin on TRECVID MED13 (3.5%), MED14 (1.3%) and CCV datasets (5.2%).
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:1512.03384 [cs.CV]
  (or arXiv:1512.03384v2 [cs.CV] for this version)

Submission history

From: Xintong Han [view email]
[v1] Thu, 10 Dec 2015 19:50:50 GMT (1833kb,D)
[v2] Thu, 7 Apr 2016 01:25:42 GMT (935kb,D)
[v3] Mon, 10 Apr 2017 17:28:16 GMT (3361kb,D)

Link back to: arXiv, form interface, contact.