We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Deep Learning Technique for Human Parsing: A Survey and Outlook

Abstract: Human parsing aims to partition humans in image or video into multiple pixel-level semantic parts. In the last decade, it has gained significantly increased interest in the computer vision community and has been utilized in a broad range of practical applications, from security monitoring, to social media, to visual special effects, just to name a few. Although deep learning-based human parsing solutions have made remarkable achievements, many important concepts, existing challenges, and potential research directions are still confusing. In this survey, we comprehensively review three core sub-tasks: single human parsing, multiple human parsing, and video human parsing, by introducing their respective task settings, background concepts, relevant problems and applications, representative literature, and datasets. We also present quantitative performance comparisons of the reviewed methods on benchmark datasets. Additionally, to promote sustainable development of the community, we put forward a transformer-based human parsing framework, providing a high-performance baseline for follow-up research through universal, concise, and extensible solutions. Finally, we point out a set of under-investigated open issues in this field and suggest new directions for future study. We also provide a regularly updated project page, to continuously track recent developments in this fast-advancing field: this https URL
Comments: Accepted for publication in International Journal of Computer Vision (IJCV)
Subjects: Computer Vision and Pattern Recognition (cs.CV)
DOI: 10.1007/s11263-024-02031-9
Cite as: arXiv:2301.00394 [cs.CV]
  (or arXiv:2301.00394v2 [cs.CV] for this version)

Submission history

From: Lu Yang [view email]
[v1] Sun, 1 Jan 2023 12:39:57 GMT (4015kb,D)
[v2] Thu, 14 Mar 2024 02:00:19 GMT (3767kb,D)

Link back to: arXiv, form interface, contact.