Self-Supervision and Spatial-Sequential Attention Based Loss for Multi-Person Pose Estimation

Liu, Haiyang; Luo, Dingli; Du, Songlin; Ikenaga, Takeshi

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2110

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Self-Supervision and Spatial-Sequential Attention Based Loss for Multi-Person Pose Estimation

Authors: Haiyang Liu, Dingli Luo, Songlin Du, Takeshi Ikenaga

(Submitted on 20 Oct 2021)

Abstract: Bottom-up based multi-person pose estimation approaches use heatmaps with auxiliary predictions to estimate joint positions and belonging at one time. Recently, various combinations between auxiliary predictions and heatmaps have been proposed for higher performance, these predictions are supervised by the corresponding L2 loss function directly. However, the lack of more explicit supervision results in low features utilization and contradictions between predictions in one model. To solve these problems, this paper proposes (i) a new loss organization method which uses self-supervised heatmaps to reduce prediction contradictions and spatial-sequential attention to enhance networks' features extraction; (ii) a new combination of predictions composed by heatmaps, Part Affinity Fields (PAFs) and our block-inside offsets to fix pixel-level joints positions and further demonstrates the effectiveness of proposed loss function. Experiments are conducted on the MS COCO keypoint dataset and adopting OpenPose as the baseline model. Our method outperforms the baseline overall. On the COCO verification dataset, the mAP of OpenPose trained with our proposals outperforms the OpenPose baseline by over 5.5%.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2110.10734 [cs.CV]
	(or arXiv:2110.10734v1 [cs.CV] for this version)

Submission history

From: Haiyang Liu [view email]
[v1] Wed, 20 Oct 2021 19:13:17 GMT (7330kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2110.10734

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Self-Supervision and Spatial-Sequential Attention Based Loss for Multi-Person Pose Estimation

Submission history