We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:


References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computer Vision and Pattern Recognition

Title: Hierarchical Video Generation for Complex Data

Abstract: Videos can often be created by first outlining a global description of the scene and then adding local details. Inspired by this we propose a hierarchical model for video generation which follows a coarse to fine approach. First our model generates a low resolution video, establishing the global scene structure, that is then refined by subsequent levels in the hierarchy. We train each level in our hierarchy sequentially on partial views of the videos. This reduces the computational complexity of our generative model, which scales to high-resolution videos beyond a few frames. We validate our approach on Kinetics-600 and BDD100K, for which we train a three level model capable of generating 256x256 videos with 48 frames.
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2106.02719 [cs.CV]
  (or arXiv:2106.02719v1 [cs.CV] for this version)

Submission history

From: Lluis Castrejon [view email]
[v1] Fri, 4 Jun 2021 21:03:52 GMT (5439kb,D)

Link back to: arXiv, form interface, contact.