References & Citations
Computer Science > Computer Vision and Pattern Recognition
Title: Optimizing Hand Region Detection in MediaPipe Holistic Full-Body Pose Estimation to Improve Accuracy and Avoid Downstream Errors
(Submitted on 6 May 2024 (v1), last revised 11 May 2024 (this version, v2))
Abstract: This paper addresses a critical flaw in MediaPipe Holistic's hand Region of Interest (ROI) prediction, which struggles with non-ideal hand orientations, affecting sign language recognition accuracy. We propose a data-driven approach to enhance ROI estimation, leveraging an enriched feature set including additional hand keypoints and the z-dimension. Our results demonstrate better estimates, with higher Intersection-over-Union compared to the current method. Our code and optimizations are available at this https URL
Submission history
From: Amit Moryossef [view email][v1] Mon, 6 May 2024 15:10:16 GMT (623kb,D)
[v2] Sat, 11 May 2024 11:01:21 GMT (888kb,D)
Link back to: arXiv, form interface, contact.