We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression

Abstract: In this paper, we are interested in the bottom-up paradigm of estimating human poses from an image. We study the dense keypoint regression framework that is previously inferior to the keypoint detection and grouping framework. Our motivation is that regressing keypoint positions accurately needs to learn representations that focus on the keypoint regions.
We present a simple yet effective approach, named disentangled keypoint regression (DEKR). We adopt adaptive convolutions through pixel-wise spatial transformer to activate the pixels in the keypoint regions and accordingly learn representations from them. We use a multi-branch structure for separate regression: each branch learns a representation with dedicated adaptive convolutions and regresses one keypoint. The resulting disentangled representations are able to attend to the keypoint regions, respectively, and thus the keypoint regression is spatially more accurate. We empirically show that the proposed direct regression method outperforms keypoint detection and grouping methods and achieves superior bottom-up pose estimation results on two benchmark datasets, COCO and CrowdPose. The code and models are available at this https URL
Comments: Accepted by CVPR2021. arXiv admin note: text overlap with arXiv:2006.15480
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2104.02300 [cs.CV]
  (or arXiv:2104.02300v1 [cs.CV] for this version)

Submission history

From: Zigang Geng [view email]
[v1] Tue, 6 Apr 2021 05:54:46 GMT (6523kb,D)

Link back to: arXiv, form interface, contact.