We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computer Vision and Pattern Recognition

Title: Leaping Into Memories: Space-Time Deep Feature Synthesis

Abstract: The success of deep learning models has led to their adaptation and adoption by prominent video understanding methods. The majority of these approaches encode features in a joint space-time modality for which the inner workings and learned representations are difficult to visually interpret. We propose LEArned Preconscious Synthesis (LEAPS), an architecture-agnostic method for synthesizing videos from the internal spatiotemporal representations of models. Using a stimulus video and a target class, we prime a fixed space-time model and iteratively optimize a video initialized with random noise. We incorporate additional regularizers to improve the feature diversity of the synthesized videos as well as the cross-frame temporal coherence of motions. We quantitatively and qualitatively evaluate the applicability of LEAPS by inverting a range of spatiotemporal convolutional and attention-based architectures trained on Kinetics-400, which to the best of our knowledge has not been previously accomplished.
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2303.09941 [cs.CV]
  (or arXiv:2303.09941v3 [cs.CV] for this version)

Submission history

From: Alexandros Stergiou [view email]
[v1] Fri, 17 Mar 2023 12:55:22 GMT (72185kb,D)
[v2] Mon, 20 Mar 2023 09:07:49 GMT (72185kb,D)
[v3] Wed, 29 Mar 2023 06:14:47 GMT (10451kb,D)

Link back to: arXiv, form interface, contact.