We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Counting Grid Aggregation for Event Retrieval and Recognition

Abstract: Event retrieval and recognition in a large corpus of videos necessitates a holistic fixed-size visual representation at the video clip level that is comprehensive, compact, and yet discriminative. It shall comprehensively aggregate information across relevant video frames, while suppress redundant information, leading to a compact representation that can effectively differentiate among different visual events. In search for such a representation, we propose to build a spatially consistent counting grid model to aggregate together deep features extracted from different video frames. The spatial consistency of the counting grid model is achieved by introducing a prior model estimated from a large corpus of video data. The counting grid model produces an intermediate tensor representation for each video, which automatically identifies and removes the feature redundancy across the different frames. The tensor representation is subsequently reduced to a fixed-size vector representation by averaging over the counting grid. When compared to existing methods on both event retrieval and event classification benchmarks, we achieve significantly better accuracy with much more compact representation.
Comments: This paper has been withdrawn by the author because this work will be part of another object which will be released soon
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:1604.01109 [cs.CV]
  (or arXiv:1604.01109v3 [cs.CV] for this version)

Submission history

From: Zhanning Gao [view email]
[v1] Tue, 5 Apr 2016 01:38:07 GMT (1254kb,D)
[v2] Fri, 12 Aug 2016 09:01:57 GMT (1996kb,D)
[v3] Tue, 11 Oct 2016 12:11:47 GMT (0kb,I)

Link back to: arXiv, form interface, contact.