THUNDR: Transformer-based 3D HUmaN Reconstruction with Markers

Zanfir, Mihai; Zanfir, Andrei; Bazavan, Eduard Gabriel; Freeman, William T.; Sukthankar, Rahul; Sminchisescu, Cristian

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2106

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: THUNDR: Transformer-based 3D HUmaN Reconstruction with Markers

Authors: Mihai Zanfir, Andrei Zanfir, Eduard Gabriel Bazavan, William T. Freeman, Rahul Sukthankar, Cristian Sminchisescu

(Submitted on 17 Jun 2021)

Abstract: We present THUNDR, a transformer-based deep neural network methodology to reconstruct the 3d pose and shape of people, given monocular RGB images. Key to our methodology is an intermediate 3d marker representation, where we aim to combine the predictive power of model-free-output architectures and the regularizing, anthropometrically-preserving properties of a statistical human surface model like GHUM -- a recently introduced, expressive full body statistical 3d human model, trained end-to-end. Our novel transformer-based prediction pipeline can focus on image regions relevant to the task, supports self-supervised regimes, and ensures that solutions are consistent with human anthropometry. We show state-of-the-art results on Human3.6M and 3DPW, for both the fully-supervised and the self-supervised models, for the task of inferring 3d human shape, joint positions, and global translation. Moreover, we observe very solid 3d reconstruction performance for difficult human poses collected in the wild.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2106.09336 [cs.CV]
	(or arXiv:2106.09336v1 [cs.CV] for this version)

Submission history

From: Andrei Zanfir [view email]
[v1] Thu, 17 Jun 2021 09:09:24 GMT (17613kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2106.09336

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: THUNDR: Transformer-based 3D HUmaN Reconstruction with Markers

Submission history