We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CV

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computer Vision and Pattern Recognition

Title: Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks

Abstract: Face detection and alignment in unconstrained environment are challenging due to various poses, illuminations and occlusions. Recent studies show that deep learning approaches can achieve impressive performance on these two tasks. In this paper, we propose a deep cascaded multi-task framework which exploits the inherent correlation between them to boost up their performance. In particular, our framework adopts a cascaded structure with three stages of carefully designed deep convolutional networks that predict face and landmark location in a coarse-to-fine manner. In addition, in the learning process, we propose a new online hard sample mining strategy that can improve the performance automatically without manual sample selection. Our method achieves superior accuracy over the state-of-the-art techniques on the challenging FDDB and WIDER FACE benchmark for face detection, and AFLW benchmark for face alignment, while keeps real time performance.
Comments: Submitted to IEEE Signal Processing Letters
Subjects: Computer Vision and Pattern Recognition (cs.CV)
DOI: 10.1109/LSP.2016.2603342
Cite as: arXiv:1604.02878 [cs.CV]
  (or arXiv:1604.02878v1 [cs.CV] for this version)

Submission history

From: Kaipeng Zhang [view email]
[v1] Mon, 11 Apr 2016 10:47:14 GMT (832kb)

Link back to: arXiv, form interface, contact.