We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Flow-based Spatio-Temporal Structured Prediction of Motion Dynamics

Abstract: Conditional Normalizing Flows (CNFs) are flexible generative models capable of representing complicated distributions with high dimensionality and large interdimensional correlations, making them appealing for structured output learning. Their effectiveness in modelling multivariates spatio-temporal structured data has yet to be completely investigated. We propose MotionFlow as a novel normalizing flows approach that autoregressively conditions the output distributions on the spatio-temporal input features. It combines deterministic and stochastic representations with CNFs to create a probabilistic neural generative approach that can model the variability seen in high dimensional structured spatio-temporal data. We specifically propose to use conditional priors to factorize the latent space for the time dependent modeling. We also exploit the use of masked convolutions as autoregressive conditionals in CNFs. As a result, our method is able to define arbitrarily expressive output probability distributions under temporal dynamics in multivariate prediction tasks. We apply our method to different tasks, including trajectory prediction, motion prediction, time series forecasting, and binary segmentation, and demonstrate that our model is able to leverage normalizing flows to learn complicated time dependent conditional distributions.
Comments: 13 pages, LaTeX; typos corrected, updated, in IEEE Transactions on Pattern Analysis and Machine Intelligence
Subjects: Computer Vision and Pattern Recognition (cs.CV)
DOI: 10.1109/TPAMI.2023.3296446
Cite as: arXiv:2104.04391 [cs.CV]
  (or arXiv:2104.04391v3 [cs.CV] for this version)

Submission history

From: Mohsen Zand [view email]
[v1] Fri, 9 Apr 2021 14:30:35 GMT (934kb,D)
[v2] Fri, 20 May 2022 10:37:55 GMT (1114kb,D)
[v3] Mon, 4 Sep 2023 19:54:59 GMT (1456kb,D)

Link back to: arXiv, form interface, contact.