We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation

Abstract: In this paper we present a novel approach for bottom-up multi-person 3D human pose estimation from monocular RGB images. We propose to use high resolution volumetric heatmaps to model joint locations, devising a simple and effective compression method to drastically reduce the size of this representation. At the core of the proposed method lies our Volumetric Heatmap Autoencoder, a fully-convolutional network tasked with the compression of ground-truth heatmaps into a dense intermediate representation. A second model, the Code Predictor, is then trained to predict these codes, which can be decompressed at test time to re-obtain the original representation. Our experimental evaluation shows that our method performs favorably when compared to state of the art on both multi-person and single-person 3D human pose estimation datasets and, thanks to our novel compression strategy, can process full-HD images at the constant runtime of 8 fps regardless of the number of subjects in the scene. Code and models available at this https URL .
Comments: CVPR 2020
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2004.00329 [cs.CV]
  (or arXiv:2004.00329v1 [cs.CV] for this version)

Submission history

From: Matteo Fabbri Ing. [view email]
[v1] Wed, 1 Apr 2020 10:37:39 GMT (6925kb,D)

Link back to: arXiv, form interface, contact.