We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:


Current browse context:


Change to browse by:


References & Citations

DBLP - CS Bibliography


(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computer Vision and Pattern Recognition

Title: Towards Realistic Face Photo-Sketch Synthesis via Composition-Aided GANs

Abstract: Face photo-sketch synthesis aims at generating a facial sketch/photo conditioned on a given photo/sketch. It is of wide applications including digital entertainment and law enforcement. Precisely depicting face photos/sketches remains challenging due to the restrictions on structural realism and textural consistency. While existing methods achieve compelling results, they mostly yield blurred effects and great deformation over various facial components, leading to the unrealistic feeling of synthesized images. To tackle this challenge, in this work, we propose to use the facial composition information to help the synthesis of face sketch/photo. Specially, we propose a novel composition-aided generative adversarial network (CA-GAN) for face photo-sketch synthesis. In CA-GAN, we utilize paired inputs including a face photo/sketch and the corresponding pixel-wise face labels for generating a sketch/photo. In addition, to focus training on hard-generated components and delicate facial structures, we propose a compositional reconstruction loss. Finally, we use stacked CA-GANs (SCA-GAN) to further rectify defects and add compelling details. Experimental results show that our method is capable of generating both visually comfortable and identity-preserving face sketches/photos over a wide range of challenging data. Our method achieves the state-of-the-art quality, reducing best previous Frechet Inception distance (FID) by a large margin. Besides, we demonstrate that the proposed method is of considerable generalization ability. We have made our code and results publicly available: this https URL
Comments: 10 pages, 8 figures, journal
Subjects: Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:1712.00899 [cs.CV]
  (or arXiv:1712.00899v4 [cs.CV] for this version)

Submission history

From: Fei Gao [view email]
[v1] Mon, 4 Dec 2017 04:24:19 GMT (6597kb,D)
[v2] Tue, 10 Jul 2018 06:20:40 GMT (5067kb,D)
[v3] Fri, 21 Dec 2018 08:12:47 GMT (3478kb,D)
[v4] Thu, 9 Jan 2020 03:35:56 GMT (2401kb,D)

Link back to: arXiv, form interface, contact.