Exploring Severe Occlusion: Multi-Person 3D Pose Estimation with Gated Convolution

Gu, Renshu; Wang, Gaoang; Hwang, Jenq-Neng

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2011

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Exploring Severe Occlusion: Multi-Person 3D Pose Estimation with Gated Convolution

Authors: Renshu Gu, Gaoang Wang, Jenq-Neng Hwang

(Submitted on 31 Oct 2020)

Abstract: 3D human pose estimation (HPE) is crucial in many fields, such as human behavior analysis, augmented reality/virtual reality (AR/VR) applications, and self-driving industry. Videos that contain multiple potentially occluded people captured from freely moving monocular cameras are very common in real-world scenarios, while 3D HPE for such scenarios is quite challenging, partially because there is a lack of such data with accurate 3D ground truth labels in existing datasets. In this paper, we propose a temporal regression network with a gated convolution module to transform 2D joints to 3D and recover the missing occluded joints in the meantime. A simple yet effective localization approach is further conducted to transform the normalized pose to the global trajectory. To verify the effectiveness of our approach, we also collect a new moving camera multi-human (MMHuman) dataset that includes multiple people with heavy occlusion captured by moving cameras. The 3D ground truth joints are provided by accurate motion capture (MoCap) system. From the experiments on static-camera based Human3.6M data and our own collected moving-camera based data, we show that our proposed method outperforms most state-of-the-art 2D-to-3D pose estimation methods, especially for the scenarios with heavy occlusions.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2011.00184 [cs.CV]
	(or arXiv:2011.00184v1 [cs.CV] for this version)

Submission history

From: Gaoang Wang [view email]
[v1] Sat, 31 Oct 2020 04:35:24 GMT (4523kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2011.00184

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Exploring Severe Occlusion: Multi-Person 3D Pose Estimation with Gated Convolution

Submission history