Deep video representation learning: a survey

Ravanbakhsh, Elham; Liang, Yongqing; Ramanujam, J.; Li, Xin

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2405

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Deep video representation learning: a survey

Authors: Elham Ravanbakhsh, Yongqing Liang, J. Ramanujam, Xin Li

(Submitted on 10 May 2024)

Abstract: This paper provides a review on representation learning for videos. We classify recent spatiotemporal feature learning methods for sequential visual data and compare their pros and cons for general video analysis. Building effective features for videos is a fundamental problem in computer vision tasks involving video analysis and understanding. Existing features can be generally categorized into spatial and temporal features. Their effectiveness under variations of illumination, occlusion, view and background are discussed. Finally, we discuss the remaining challenges in existing deep video representation learning studies.

Comments:	Multimedia Tools and Applications (2023) 1-31
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2405.06574 [cs.CV]
	(or arXiv:2405.06574v1 [cs.CV] for this version)

Submission history

From: Elham Ravanbakhsh [view email]
[v1] Fri, 10 May 2024 16:20:11 GMT (1810kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2405.06574

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Deep video representation learning: a survey

Submission history