Current browse context:
cs.CV
Change to browse by:
References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: Human Pose Estimation in Space and Time using 3D CNN
(Submitted on 31 Aug 2016 (v1), last revised 19 Oct 2016 (this version, v3))
Abstract: This paper explores the capabilities of convolutional neural networks to deal with a task that is easily manageable for humans: perceiving 3D pose of a human body from varying angles. However, in our approach, we are restricted to using a monocular vision system. For this purpose, we apply a convolutional neural network approach on RGB videos and extend it to three dimensional convolutions. This is done via encoding the time dimension in videos as the 3\ts{rd} dimension in convolutional space, and directly regressing to human body joint positions in 3D coordinate space. This research shows the ability of such a network to achieve state-of-the-art performance on the selected Human3.6M dataset, thus demonstrating the possibility of successfully representing temporal data with an additional dimension in the convolutional operation.
Submission history
From: Amogh Gudi [view email][v1] Wed, 31 Aug 2016 20:55:26 GMT (7776kb,D)
[v2] Wed, 14 Sep 2016 16:17:15 GMT (7776kb,D)
[v3] Wed, 19 Oct 2016 12:44:15 GMT (1611kb,D)
Link back to: arXiv, form interface, contact.